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Abstract 

For a fixed integer r, consider the following random process. At each round, one is presented 
with r random edges from the edge set of the complete graph on n vertices, and is asked to choose 
one of them. The selected edges are collected into a graph, which thus grows at the rate of one 
edge per round. This is a natural generalization of what is known in the literature as an Achlioptas 
process (the original version has r = 2), which has been studied by many researchers, mainly in the 
context of delaying or accelerating the appearance of the giant component. 

In this paper, we investigate the small subgraph problem for Achlioptas processes. That is, 
given a fixed graph we study whether there is an online algorithm that substantially delays 
or accelerates a typical appearance of , compared to its threshold of appearance in the random 
graph G{n,M). It is easy to see that one cannot accelerate the appearance of any fixed graph 
by more than the constant factor r, so we concentrate on the task of avoiding H. We determine 
thresholds for the avoidance of all cycles Ct, cliques Kt, and complete bipartite graphs Kt.t, in 
every Achlioptas process with parameter r > 2. 

1 Introduction 

The standard Erdos-Renyi random graph model G{n, M) can be described as follows. Start with the 
empty graph on n vertices, and perform M rounds, adding one random edge to the graph at each 
round. For any monotone increasing graph property (such as containment of as a subgraph, say), 
it is natural to ask whether there is some value of M at which the probability of G{n, M) satisfying 
the property changes rapidly from nearly to nearly 1. More precisely, a function M*{n) is said to be 
a threshold for a property V if for any M(n) ^ M*(n), the random graph G{n,M) does not satisfy 
V whp, but for any M(n) ^ M*(n), the random graph G{n,M) satisfies V whp. Here, whp stands 
for with high probability, that is, with probability tending to 1 as n — > 00, and f{n) <^ g{n) means 
that f{n)/g{n) ^ as n — > 00. A classical result of Bollobas and Thomason [TO] implies that every 
monotone graph property has a threshold, and much work has been done to determine thresholds for 
various properties. 

Recently, there was much interest in the following natural variant of the classical model. We still 
begin with the empty graph and perform a series of rounds, but at each round, one is now presented 
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with two independent and uniformly random edges, and is asked to choose one of them to add to the 
graph. This is known in the hterature as an Achlioptas process, after Dimitris AchUoptas, who asked 
the question of whether there was an onhne algorithm which could, with high probability, substantially 
delay the appearance of the giant component (a connected component with ^}{n) vertices). 

The trivial algorithm, which arbitrarily chooses the first edge in each offered pair, essentially 
produces the random graph G{n, M) after M rounds, so G{n, M) serves as the benchmark against 
which comparisons are made. A classical result of Erdos and Renyi [llj states that \i M = cn for 
any absolute constant c > 1/2, then the random graph G{n,M) contains a giant component whp. 
For the Achlioptas process, Bohman and Frieze ^ presented an algorithm which could run for 0.535n 
rounds, while keeping the size of the largest component only poly-logarithmic in n whp. Since then, 
much work has been done [11 [3l [5l [6l [TJ [131 [IS]- The current best result for this problem is due to 
Spencer and Wormald [16], who exhibit an algorithm that can run for 0.829n rounds while keeping all 
component sizes bounded by O(logrz) whp. In the opposite direction, Bohman, Frieze, and Wormald 
[5] have shown that no algorithm can succeed whp past 0.964n rounds. Several variants have also been 
studied, such as the offline version, a two-player version, and the question of embracing (accelerating 
the appearance of) the giant component. 

While the main focus of the research mentioned above was the giant component, it is natural 
to study other graph properties in the context of Achlioptas processes. In this paper, we study the 
problem which in the literature is referred to as the small subgraph problem. This was one of the main 
problems studied in the seminal paper of Erdos and Renyi [IT] from 1960, which was the starting point 
of the theory of random graphs. The original problem, stated for the random graph model G{n,M), 
was as follows: given a fixed graph H (a triangle or K4, say), find the smallest value of M such that 
the random graph G(n, M) contains H as a (not necessarily induced) subgraph whp. The subgraph 
is called "small" because its size is fixed while n tends to infinity. 

It turns out that in this problem, the relevant parameter is the maximum edge density m{G) = 
inax{e{H)/v{H) : H is a subgraph of G}. In their original paper, Erdos and Renyi found thresholds 
for all balanced graphs, which are the graphs whose edge density e{H)/v{H) equals the maximum 
edge density m{H). It was not until 20 years later that Bollobas [9] solved the problem for all graphs, 
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proving that for any H with m{H) > 1, the threshold for H appearing in G{n, M) is M* = '"f^) . 
For further reading about the small subgraph problem in G(n, M), we direct the interested reader to 
the monographs by Bollobas [8J and by Janson, Luczak, and Rucinski [TJ], each of which contains an 
entire section discussing this problem. 

In this paper, we consider the small subgraph problem in the context of Achlioptas processes, 
and investigate whether one can substantially affect thresholds by introducing this power of choice. 
Actually, we study a natural generalization of the process, which we call an Achlioptas process with 
parameter r. In this process, r edges of Kn are presented at each round, and one of them is selected. 
We will always consider r to be fixed as n tends to infinity (note that r = 2 corresponds to the original 
Achlioptas process). 

Let us now state our model precisely. At the i-th round, one is presented with r independent 
random edges, each distributed uniformly over all (2) — {i — 1) remaining edges that have not yet been 
chosen for the graph. Note that this eliminates the possibility of choosing the same edge twice, so our 
final graph is simple. However, we do allow the possibility that edges may be offered more than once. 
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which simphfies our arguments. One may consider models in which all sampling is with replacement 
(which may create multigraphs), or in which every edge is offered at most once, but our results in this 
paper will still carry over because we always run the process for o(n^) rounds. 

Note that the graph after the A;-th round of the Achlioptas process with parameter r is a subgraph 
of the random graph with rk edges. So, the question of accelerating the appearance of a fixed graph 
is immediately resolved in the negative. Clearly, the threshold cannot move forward by more than a 
(constant) factor of r. 

So, in this paper we concentrate on the avoidance problem. We may pose it as a single player 
game in which the player loses when he creates a (not necessarily induced) subgraph isomorphic to 
a certain fixed graph H. The player's objective is to postpone losing for as long as possible. We say 
that a function m*{n) is a threshold for avoiding H if: (i) given any function m[n) <^ m*(n), there 
exists an online strategy by which the player survives through m rounds whp, and (ii) given any 
function m(n) ^ m*(n), the player loses by the end of m rounds whp, regardless of the choice of such 
a strategy. 

Note, however, that it is not obvious that thresholds necessarily exist. Furthermore, unlike the 
situation in the small subgraph problem, there are no simple first-moment calculations that suggest 
what the thresholds should be. As it turned out, a substantial part of the difficulty in obtaining our 
results was in conjecturing the correct thresholds. We were able to solve the problem for all cycles Ct, 
cliques Kt, and complete bipartite graphs Kt^t- Let us now state our main result: 

Theorem 1.1. 

(i) For t > 3, the threshold for avoiding Ct in the Achlioptas process with parameter r > 2 is 

r, (t-2)r+2 
n {t-l)r+l _ 

(ii) For t > 4, the threshold for avoiding Kt in the Achlioptas process with parameter r > 2 is n^"^ , 

where 9 is defined as follows: 

r\t-2) + 2 



s= Liog,[(r-i)t + i]j, e 



r-1 

(iii) For t > 3, the threshold for avoiding Kt^t in the Achlioptas process with parameter r > 2 is n?^^ , 
where 9 is defined as follows: 

r%2t - 2) + 2 



s= Llog,[(r-l)t + l]J, 



Remark. In all of these cases, we provide deterministic online algorithms that achieve the thresholds 
whp, but show that even randomized algorithms cannot survive beyond the thresholds. 

The rest of this paper is organized as follows. In the next section, we present some tools from 
extremal combinatorics and the theory of random graphs, which we will use in our proofs. Then, we 
present the proof of our theorem, which is divided into several sections. We begin in Section [3] with 
the case of avoiding when r = 2, which turns out to be the first nontrivial case. We treat this case 
in detail, because our argument there is the prototype for the general argument that we later use to 
prove thresholds for Kt, Kt^t, and Cj. 
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We extend the argument to almost all other Kt and r in Section HI The proof requires many 
inequalities whose somewhat tedious verifications would interfere with the exposition, so we postpone 
their proofs to the appendix. This also makes it easier to distill the abstract argument, which we 
present in Section O Next, we apply the abstraction to prove thresholds for avoiding Ct in Section [U] 
and Kt^t in Section [71 We treat the last remaining case of avoiding K4 in the Achlioptas process with 
parameter 3 in Section [HI The final section contains some concluding remarks and open problems. 

2 Preliminaries 

2.1 Notation and terminology 

Throughout our paper, we will omit floor and ceiling signs whenever they are not essential, to improve 
clarity of presentation. The following (standard) asymptotic notation will be utilized extensively. For 
two functions f{n) and g{n), we write f{n) = o{g{n)) or g[n) = uj{f{n)) if linin-^oo fin) /g{n) = 0, 
and f{n) = 0{g{n)) or g{n) = ^}{f{n)) if there exists a constant M such that |/(^^)| < M\g(n)\ for 
all sufficiently large n. We also write f{n) = Q{g{n)) if both f{n) = 0{g{n)) and /(n) = ^{g{n)) are 
satisfied. 

Let us introduce the following abbreviations for some phrases that we will use many times in 
our proof. As mentioned in the introduction, whp will stand for with high probability, i.e., with 
probability 1 — o(l). It is also convenient for us to introduce the abbreviation wep, which stands 
for with exponential probability, i.e., with probability 1 — o(e~"'^^) for some c > 0. We will say that 
a function / is a positive power of n if f = i}{n^) for some c > 0. Analogously, we will say that a 
function / is a negative power of n if f = 0{n~^) for some c > 0. 

Next, let us discuss the graph-specific terms that we will use. We often need to consider the 
graphs at intermediate stages of the Achlioptas process, so Gi will always denote the graph after the 
i-th round. Our main interest in Gi will be to count copies of subgraphs. Here, we define a copy of 
a graph H in another graph G to be an injective map from V{H) to V{G) that preserves the edges 
of H. Note that copies are not necessarily induced subgraphs, and are labeled, i.e., we do not take 
automorphisms into account when computing the number of copies of -fT in a graph. 

The player's objective in the Achlioptas process is to avoid creating a copy of a certain fixed graph 
H, but our analysis needs to consider subgraphs of H as well. It is therefore convenient to introduce 
the notation H \ke to represent any graph which can be obtained by deleting any k edges from H. 
(When A: = 1, we will simply write H \ e.) This enables us to concisely refer to all graphs of the 
form H \ke in the aggregate. For example, the phrase "the number of copies of H \ ke" should be 
understood to be the total number of copies of all graphs of the form H \ ke. 

We keep track of the numbers of copies of these subgraphs by studying how counts are affected by 
the addition of an edge at a pair of vertices. This motivates the following definition. Let G and H 
be graphs, let k be an integer, and let a,b he a pair of distinct vertices of G. Let G^ be the graph 
obtained from G by adding the edge between a and b if it is not yet present, and let G~ be the graph 
obtained by deleting that edge if it was present. Note that G is equal to either G^ or G^ . Then, we 
say that the pair {a, 6} completes t copies of H\ke if t is the difference between the number of copies 
oi H\ke in G^ and the number in G~ . 

Sometimes, we need to be specific about which graphs of the form H\{k-\- l)e are completed into 
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graphs of the form H \ ke. Let Hi and H2 be graphs on the same vertex set U, with E{Hi) C E{H2), 
but differing only in exactly one edge. Let {u, t'} C C/ be the endpoints of that edge. Let G be another 
graph, and let a, 6 be a pair of distinct vertices of G. Then, we say that the pair {a, h} extends t copies 
of Hi into H2 if t is the number of injective graph homomorphisms (f) : Hi — > G that map {u,v} to 
{a, b}. Note that this definition is insensitive to the presence of an edge between a and b. 

2.2 Extremal combinatorics 

In this section, we present two extremal results, which are used in the proofs of the upper bounds in 
our thresholds (i.e., that no strategy can survive for too many rounds). The following lower bound on 
the number of paths in a graph was obtained in [12J using a matrix inequality of Blackley and Roy. 

Lemma 2.1. Every graph with n vertices and average degree d contains at least {\ + o{l))nd^~'^ copies 
of the t-vertex path Pt- Here, we consider t to he fixed, while d and n tend to infinity. 

Next, we record the following well-known extremal result, which lower bounds the number of copies 
of the complete bipartite graph Kg^t that can appear in any graph with a fixed number of edges. The 
classical proof (via two applications of convexity) is based on the ideas used by Kovari, Sos, and Turan 
|15| to bound the Turan number ex{n, Kg^t)- 

Lemma 2.2. For fixed positive integers s <t, and any function p ^ n~^/*, every graph withn vertices 
and (2)^ edges contains at least (1 + o(l))n*+*p'** copies of the complete bipartite graph Kg^t- 

2.3 Random graphs 

We begin by recalling the Chernoff bound for exponential concentration of a binomial random variable. 
We use the formulation from [1] . 

Theorem 2.3. For any e > 0, there exists Ce > such that the following holds. Let X he any binomial 
random variable, and let fi be its expectation. Then P [\X — ^| > efj] < 2e~'^^^. 

Using the Chernoff bound and a standard coupling argument, we prove a result that allows us to 
relate Gm (the graph after the m-th round of the Achlioptas process) to the more familiar random 
graph G{n,p). 

Lemma 2.4. Suppose that n <^ m <^ n^. Then we may couple the Achlioptas process with G{n,p = 
Arm/n'^) in such a way that w€.p, Gfji is a subgraph ofGi^n^p^. 

Proof, hi the Achlioptas process, r random edges are presented at each round, independently and 
uniformly distributed over all potential edges that have not yet been picked for the graph. So, we 
may couple the first m rounds of the process with the edge-uniform random graph G{n, rm) in such 
a way that if we consider the graph G^ obtained by taking every edge that was offered (instead of 
choosing only one per round), G^ is always a subgraph of G{n,rm). Yet Gm is always a subgraph 
of so it remains to relate G{n,rm) with G{n,p = Arm/n^). This final part is standard and 
proceeds via coupling with the random graph process; under this coupling, G(n, rm) C G{n,p) as long 
as Bin[(2),p] > rm, and the Chernoff bound shows that this event occurs wep. □ 
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Our analysis revolves around counting copies of fixed subgraphs in Gm ■ The previous lemma allows 
us to apply results from the theory of G{n,p) to assist us in this pursuit. We now record several such 
theorems, translated in terms of Gm- The following definition is crucial for counting subgraphs in 
G{n,p). 

Definition 2.5. A graph H is balanced if for any subgraph H' C H, < 

Theorem 2.6. Let H be a fixed balanced graph with v vertices and e edges. Suppose that n ^ m <^ n^, 
and let p = 2m/n^. Also suppose that n'"p^ is a positive power of n. Then the number of copies of H 
in Gm is 0{n'"p^) wep. 

Proof. By Lemma |2.4| it suffices to count copies of H in G{n,2rp). The expected number of copies 
is (1 + o{l))n^ (2rpY = Q{n^p'^), which is a positive power of n by assumption. This allows us to apply 
Corollary 6.3 of [T7], which uses Kim-Vu polynomial concentration to prove the following result: for 
any balanced graph H such that the expected number of copies of H in the random graph is ^ S> log n, 
the probability that the actual number of copies exceeds 2^ is e"^*^'*-' . In our case, is a positive power 
of n, so this implies that wep, the number of copies is 0{n"p'^), as desired. □ 

The previous result provides a very precise count of the number of copies of a fixed graph in the 
random graph G{n,p). However, the point of the Achlioptas process was to deviate from G{n,p) by 
introducing the power of choice. So, our analysis will have to take the potential of choice into account. 
We keep track of the numbers of copies of subgraphs by studying how counts are affected by the 
addition of an edge at a pair of vertices; this motivated the notions of a pair completing t copies of 
H \ke and of the pair extending t copies of Hi into H2, which we defined at the end of Section [2.11 

This is essentially the problem of counting extensions, which has also been well-studied in G{n,p). 
We refer the interested reader to Chapter 10 of [1]. As in the case of counting subgraphs in G{n,p), 
a suitable definition of balanced-ness is required to count extensions. 

Definition 2.7. 

(i) Let Hi and H2 be graphs on the same vertex set U , with E{Hi) C E[H2), but differing only on the 

edge joining the vertices u,v (z U . We say that the pair {Hi,H2) is a balanced extension pair 
if for every proper subset U' C U that still contains {u,v}, the induced subgraph H' = Hi\U'] 
has the property that ^^^^ < ^^H^y2 ■ 

(ii) H\ke has the balanced extension property if every pair {Hi,H2) with V{Hi) = V{H2) = 

V{H), E{Hi) C E{H2) C E{H), e{Hi) = e{H) - k, and 6(^2) = e{H) -k + 1, is a balanced 
extension pair. 

Theorem 2.8. Suppose that n <^ m <^ r? , and let p = 2m/n'^ . Let {Hi,H2) be a balanced extension 
pair, and let v and e be the numbers of vertices and edges in Hi, respectively. Finally, let j be an 
arbitrary integer constant. 

(i) Suppose that n'"~'^p^ is a positive power of n. Then wep, every pair of distinct vertices {a, 6} of 
Gjm extends 0{n'"~'^p^) copies of Hi into H2. 
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(ii) Suppose that n'"^'^p^ is a negative power of n. Then, for any constant 7 > 0, there exists a 
constant C such that with probability 1 — o{n~^), every pair of distinct vertices {a,b} of Gjm 
extends at most C copies of Hi into H2. 

Proof. By Lemma [2.41 it suffices to consider G{n, 2rjp) instead of Gjm in both parts of the theorem. 
For part (i), the expected number of extensions at a pair in G{n,2rjp) is {1 + o{l))n'"-'^{2rjpY = 
Q{n^~'^p'^), which is a positive power of n by assumption. This allows us to apply Corollary 6.7 of |17j . 
which uses Kim-Vu polynomial concentration to prove the following result: for any balanced extension 
pair {Hi, H2) such that the expected number /i of copies of Hi that a fixed edge extends into H2 in the 
random graph is a positive power of n, the probability that the actual number of extensions exceeds 
2^ is e~^^^\ In our case, /i is a positive power of n, so even after taking a union bound over all O(n^) 
pairs of vertices, this implies that wep, every pair of vertices extends 0{n^~^p^) copies of Hi into H2. 
This establishes (i). 

For part (ii), let us bound the probability that {a, b} extends C copies of Hi into H2. Recall that 
Hi and H2 shared the same vertex set U, and differed only on the edge joining u,v G U. Consider 
any graph F which is formed by the superposition of C distinct copies of Hi, all with {u, v} mapping 
to the same pair of vertices {u',v'} E V{F). Let v' = v{F) and e' = e(-F). 

The probability that {a, b} has an extension to F (an injective map from V{F) sending {u' , v'} 1— > 
{a,b}) in G{n,2rjp) is at most n'''-^{2rjpY' = 0((np^'/K-2))^''-2)_ 

easy and standard induction, 

using the fact that {Hi,H2) is a balanced extension pair, implies that > Hence this 

v' —2 

probability is at most 0{{np'^/ '^'"''^^Y ~'^) = 0{{n'"~'^p'^)~^). 

We assumed that n"~'^p^ was a negative power of n. Also, since the C copies of Hi in F are 

1 

distinct, one can trivially bound C < {v' — 2)""^ =^ v' — 2 > C^-^ . So, for a sufficiently large constant 
C, the probability that {a, b} has an extension to F is o{n~^~'^). Taking a union bound over all O(n^) 
pairs of vertices, we see that the probability that there exists any pair of vertices with an extension to 
F is o{n~'^). Since C is a constant, the number of non-isomorphic ways to form F (a superposition of 
G distinct copies of Hi, overlapping on one particular edge) is still a constant. Taking another union 
bound over all such F, we complete the proof. □ 

Corollary 2.9. Suppose that n <^ m <^ , and letp = 2m/n^ . Let H\ke have the balanced extension 
property, and let v and e be the numbers of vertices and edges in H \ ke. Suppose that n^~'^p^ is a 
negative power of n. Let us consider Gjm, where j is an arbitrary integer constant. Then, for any 
constant 7 > 0, there exists a constant G such that with probability 1 — o{n^'^), every pair of distinct 
vertices {a, b} of Gjm completes at most G copies of H \{k — l)e. 

Proof. Fix a pair {a,b}. When counting the number of copies H \ (k — l)e completed by that 
pair, each copy arises from an extension pair {Hi,H2) and an extension of Hi to H2 at the pair. In 
fact, this correspondence is bijective. The balanced extension property guarantees that all such pairs 
are balanced. Since H \s a, fixed graph, only a constant number of non-isomorphic pairs {Hi, H2) can 
arise in this way, so repeated application of Theorem I2.8f ii) completes the proof. □ 
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3 Warm-up 



The purpose of this section is to illustrate on a concrete example the main ideas and techniques that 
we will use in our proofs. We investigate the first nontrivial case, which is the problem of avoiding K/j^ 
in the Achlioptas process with parameter 2. This turns out to be the model for the general case. 

Theorem. The threshold for avoiding K4 in the Achlioptas process with parameter 2 is n'^^^^^ . 

Proof. Lower bound: We need to specify a strategy, and prove that it avoids K4 for many rounds. 
At any intermediate stage in the process, consider a pair of points to be 2-dangerous if the addition 
of an edge between them will create a copy of K4. Otherwise, if the addition of the edge will create 
a copy of K4 \ e, call the pair 1- dangerous. Every other pair is considered to be 0- dangerous (not 
dangerous). The strategy is then to make an arbitrary choice among the incoming edges that are 
minimally dangerous. 

Let m be a function of n that satisfies m <C n^^^^^. It suffices to show that for any such m, 
this strategy succeeds whp. We also may assume without loss of generality that m ^ n'^^^^^ / logn. 
The precise form of the lower bound on m is not essential; it simplifies the argument by disposing of 
uninteresting pathological cases when m is too small. As it is easier to work with G{n,p), we will make 
all of our computations with respect to p, which we define to be 2m/n^. Note that n~^'^/^^/logn <C 
p -C n"^"/^^. The following three claims analyze the performance of our strategy. 

(i) With probability l — o{n~^), Gm has 0{n'^p^) copies of K4\2e and every pair of vertices completes 

0(1) copies of K4 \ e. 

(ii) With probability 1 — o(n~^), Gm has 0{n^p^) copies of K4 \ e. 

(iii) The probability of failure in m rounds is o(l). 

For (i), it is easy to verify that K4 \ 2e is a balanced graph, no matter which two edges are deleted. 
Then the number of copies of K4 \ 2e is roughly what it should be in the random graph G{n,p) — this 
is made precise by Theorem 12. 6| which bounds the number of copies of K4 \ 2e in Gm by O(n^p^) wep 
since n'^p^ is a positive power of n. It is also easy to verify that K4 \ 2e has the balanced extension 
property, so since n'^p^ is a negative power of n. Corollary 12.91 shows that there is some constant G 
such that with probability 1 — o(n~^), every pair of vertices in Gm completes at most G copies of 
K4 \ e. This proves (i). 

For (ii), fix some i < m and consider the (i + l)-st round. In this round, the strategy will create 
one or more copies of K4 \ e only if both incoming edges span pairs that are 1- or 2-dangerous. The 
number of such pairs is at most 0(1) times the number of copies of K4 \ 2e. Since Gi C Gmi claim 
(i) shows that with probability 1 — o(n~^), Gi has 0{n^p'^) copies of K4 \ 2e and every pair of vertices 
completes 0(1) copies of K4 \ e. Call this event Ai, and condition on it. Even after conditioning, the 
incoming edges at the (z-l-l)-st round are still independently and uniformly distributed over the J7(n^) 
unoccupied pairs of Gi, so the probability that we are forced to create a new copy of i^4\e in this round 
is 0((^^^^)^) = 0{n^p^). Furthermore, each time this occurs, we only create 0(1) new copies of i^4\e 
because of our conditioning. Therefore, the number of new copies of K4 \ e in the {i + l)-st round 
is stochastically dominated by 0(1) times the Bernoulli random variable with parameter O(n^p^). 
Letting i run through all m rounds, we see that with probability at least 1 — ^ P ^ 1 ~ o(n^^). 



8 



the number of copies of K4\e in Gm is 0(1) -Bin [m, 0{n'^p^)] . Since m = n?p/2, the expectation of this 
binomial is a positive power of n, so the Chernoff bound implies that wep, it is 0{m-n^p^) = 0{n^p^). 
This proves (ii). 

For (iii), fix some i and consider the probabihty that we lose in the {i + l)-st round. The strategy 
fails precisely when both of the incoming edges span pairs that are 2-dangerous (completing ^^4), and 
the number of such pairs is at most 0(1) times the number of copies of \ e. Since Gi C Gm, claim 
(ii) shows that with probability 1 — o(n~^), Gi has 0{n^p^) copies of K4 \ e. Call this event Bi, and 
condition on it. Even after conditioning, the incoming edges are still independently and uniformly 
distributed over the O(n^) unoccupied pairs of Gi, so the probability that both incoming edges are 
2-dangerous is 0((^^^^f-)^) = 0{n^p^^). Therefore, letting i run through all m = n'^p/2 rounds, a 
union bound shows that the probability that we are forced to complete a copy of by the end of the 
m-th round is P < Oin^p ■ n^p^^) + h-^i] = 0{n^^p^^) + o(l) = o(l). 

Upper bound: Now suppose that m ^ n^^/^^. It suffices to show that we will lose within the first 4m 
rounds whp. Again, we may assume without loss of generality that m <C n^^^^^logn, and we will 
work in terms of G{n,p) with p = 2m/n^. Note that n^^^/^^ ^ p <C n^^^^^^ logn. Let us specify a 
sequence of graphs such that each graph is obtained from the previous one by adding a single edge: 
let Hq = (4- vertex path). Hi = O4 (4-cycle), H2 = \ e, and = K^. It is easy to verify that 
the corresponding pairs {Hq,Hi), {Hi,H2), and [H2,H^) are all balanced extension pairs. Our result 
follows from the following four claims: 

(i) Gm always contains Q,{n'^p^) copies of Hq. Also, wep, every pair of vertices in G2m extends 

0{n?'p^) copies of Hq into Hi. 

(ii) G2m contains U,{n^p'^) copies of Hi whp, and with probability 1 — o(n~^), every pair of vertices 

in G-im extends 0(1) copies of Hi into H2. 

(iii) contains Q{n^p^) copies of H2 whp, and with probability 1 — o(n~^), every pair of vertices 
in G/im extends 0(1) copies of H2 into i/3. 

(iv) The probability of survival through 4m rounds is o(l). 

Proof of (i). Since the average degree in Gm is precisely 2m/n = np ^ 1, from Lemma 12.11 we 
conclude that the number of 4- vertex paths is ^{n{np)^). The second part of this claim follows from 
Theorem 12. Sf i) since {Hq, Hi) is a balanced extension pair and n'^p^ is a positive power of n. □ 

Proof of (ii). The second part of (ii) follows from Theorem I2.8l fii) since {Hi,H2) is balanced and 
nPp^ is a negative power of n. To prove the first part of (ii), consider the {i + l)-st round, where 
m < i < 2m. Regardless of the choice of strategy, if both incoming edges span pairs that extend 
Q{'n?p'^) copies of Hq into Hi, we will be forced to create Q.{n?'p^) new copies of Hi. 

By (i), the total number of copies of Hq in Gi D Gm is il(n^p'^). For a pair of vertices {a,h}, let 
Uafi be the number of copies of Hq that {a, 6} extends to Hi. Recall that this definition does not 
depend on the presence of an edge between a and h. Since Gi C G2m-, claim (i) shows that wep, in Gi 
every Uafi = 0{n?'p^). Call this event Ai, and condition on it. 

Let us estimate the average value of Ua^b over all pairs. Since Hq differs from Hi at exactly one 
edge, each copy of Hq has a pair at which it contributes +1 to the sum Therefore, averaging 
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over all (2) pairs of vertices, we obtain that the average number of copies of Hq that are extended to 
Hi at a pair is Q{v?p^). On the other hand, every pair of vertices in Gi extends 0{v?p^) copies of Hq 
into Hi. Therefore, at least a constant fraction 7 (where 7 = can be chosen to be the same for 
all i) of all (2) pairs have the property of extending r2(n^p^) copies of Hq into Hi. Let P be the set 
of all such pairs. Regardless of the choice of strategy, if both incoming edges span pairs in P, we will 
be forced to create Q{n'^p^) copies of Hi. Since i = o(n^) = o(|P|) and incoming edges are uniformly 
distributed over the (2) ~ ^ = (1 ~ ^i^))(2) unoccupied pairs, we conclude that the probability that 
both incoming edges span pairs in P is g > (1 + o(l))7^ = 

Let i run from m to 2m. Then, up to an error probability of at most P [~'^«] = the number 
of copies of Hi in G2m is at least Bm{m,q) ■ $7(n^p^). By the Chernoff bound, the binomial factor 
exceeds mq/2 = J7(n^p) wep; thus, whp G2m has ri(n^p • n'^p^) = r2(n^p^) copies of Hi. □ 

Proof of (iii). The second part of (iii) follows from Theorem 12.8( 11) since {H2,H^) is balanced and 
n^p^ is a negative power of n. For the first part of (iii), let us consider the {i + l)-st round, with 
2m < i < 3m. Regardless of the choice of strategy, if both incoming edges span pairs that extend 
copies of Hi into H2, we will create a copy of H2. Let P be the set of all such pairs. We need a lower 
bound on |P|. Condition on the event B that G2m contains O(n^p^) copies of Hi, which occurs whp 
by (ii). Also by (ii), with probability 1 — o(n~^), every pair of vertices in Gi only extends 0(1) copies 
of Hi into H2, since Gi C G^m- Call this event Gi, and condition on it. 

Note that every copy of Hi contributes a pair to P which extends Hi into H2, namely the pair at 
which it is missing an edge compared to H2. On the other hand, every such pair was only counted 
0(1) times, since every pair in Gi extends 0(1) copies of Hi into H2. This implies that \P\ = Q{n^p'^). 
The incoming edges are uniformly distributed over all unoccupied pairs. If at least half of the pairs 
in P were occupied, then we would have Q{n^p^) ^ n^p^ copies of H2, which would already give the 
conclusion of (iii). Otherwise, the probability that both incoming edges span pairs in P (hence forcing 
the creation of a new copy of H2) is g > (1 + o(l)) (^^r/^)^ = ri((^^^)^) = r2(n^p^). 

Letting i run from 2m to 3m, we see that with error probability at most P [~^B] + ^ P ["'Oj] = o(l), 
either we already obtained the conclusion of (iii), or the total number of copies of H2 is at least 
Bin(m, g). The expectation of this binomial is (n^p/2)g = Q.{n^p^), which is a positive power of n. 
Hence, by the Chernoff bound, G^m has Q.{n^p^) copies of H2 whp. □ 

Proof of (iv). Consider the (i + l)-st round, where 3m < i < 4m. Regardless of the choice of strategy, 
if both incoming edges span pairs that complete copies of H3 = K4, we lose. We can lower bound 
the number of such pairs by Q{n^p^) by conditioning on the following events. Let D be the event 
that contains Q{n^p^) copies of H2, which occurs whp by (iii). Also by (iii), with probability 
1 — o(n~^), every pair of vertices in Gi extends 0(1) copies of H2 into H^; call this event Ei. 

Even after conditioning, incoming edges in the (i + l)-st round are independently and uniformly 
distributed over the (2) ~ = 0(n^) unoccupied pairs of Gi. Therefore, the probability that both 
pairs complete K4, conditioned on survival through the i-th round, is pi = r2((^^^^^)^) = J7(n^p^®). 
Letting i run from 3m to 4m, we see that the probability that any strategy can survive for 4m rounds 
is at most 

P < F [^D] + [^Ei] + 11(1 - p,) < 0(1)+ exp[-^p,^ 
< 0(1) + exp { - n{n'^p ■ nV^)} = o(l) + e"*^^^^ = o{l), 
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which completes the proof. □ 

4 Avoiding Kt, general case 

The previous section proved the threshold for avoiding Kt in the Achlioptas process with parameter 
r, when t = 4 and r = 2. The case t = 3 will be covered in Section [H which considers all cycles Ct- In 
this section, we resolve all other cases, except for the special case {t, r) = (4, 3) which requires more 
delicate analysis. We postpone this final case to Section [8l 

Theorem. For either t > 5 and r > 2, or t = 4 and r > 4, the threshold for avoiding Kt in the 
Achlioptas process with parameter r > 2 is n^~^, where 9 is defined as follows: 

r^(f _ o^i -I- 2 
s=[\ogA{r-l)t + l]l ^ 



Before we begin the proof, let us prove an inequality that we will use in two claims in the lower 
bound, and the last claim of the upper bound. 

Inequality 4.1. Let a > 2, b > 0, and r > 1, and let s be a positive integer. Define the sequences 
{xs,Xs-i, . . . , xq} and {ys,ys-i, • • • , Ho} as follows. Set Xg = a and ys = b, and define the rest of the 
terms recursively by 

Xk-i = 2 + {xk- 2)r, yk-i = 1 + ytr. 

Then for any p ^ n"^'"/^", n^'^p^'' is a positive power of n for every A; G {s, . . . , 1}. 

Proof. Fix any k {s, . . . ,1}. One can easily solve the recursions for Xk and yk to find: 



Xfc = r^~^(a - 2) + 2, = + ' ^ 



r - 1 

Therefore, 

nS — k 



Xk _ r'-^ia - 2) + 2 _ r%a - 2) + 2r'' 



Vk r'-^b + ^^^^ r^b + 



r— 1 r— 1 

By the original definition via the recursions, x^ and yk are both positive, so the numerator and 
denominator of the final fraction above are positive. Yet as k decreases, the numerator decreases 
and the denominator increases. Therefore, Xk/yk > xo/yo. In particular, since we assumed that 
p ^ we conclude that n^kpHk jg ^ positive power of n, as desired. □ 

Note that if we choose a = v{Kt) = t and b = e{Kt) — s = {2) ~ s, then the above recursions 
produce xq and yo such that the fraction xo/yo is equal to our 6. Let us now return to the proof of 
our thresholds for avoiding Kt. 

Proof of Theorem. Lower bound: The strategy is a natural extension of the one used to avoid 
K4. At any intermediate stage in the process, for any 1 < d < s, consider a pair of points to be 
d- dangerous if d is the maximal integer such that the addition of an edge between them will create a 
copy of Kt\{s — d)e. If there is no such d, consider the pair to be 0-dangerous. The strategy is then 
to make an arbitrary choice among the incoming edges that are minimally dangerous. 
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Let m <C n^~^, and let p = 2m /v?. Again, we assume without loss of generality that m ^ 
n-^~^/logn. Note that n~^/Iogn ^ p ^ n^^ . We will analyze the performance of our strategy by 
proving three successive claims: 

(i) With probability 1 - 0(11^^^), Gm has 0( 

n^piz) copies of Kt \ se, and every pair of vertices 

completes 0(1) copies of Kt\{s — l)e. 

(ii) For each A; € {s, s — 1, . . . , 2}, and constants x and y such that (rPp) ( "^f ) ^ is a positive power 

of n, statement (a) implies statement (b), which are defined as follows: 

(a) With probability 1 — o(n~^^), Gm has 0{n^py) copies of Kt \ ke, and every pair of vertices 
completes 0(1) copies of Kt\{k — l)e. 

(b) With probability 1 — o(n~^('^~^)), Gm has 0((n^p) (^^^)'') copies Kt\{k — l)e, and every 
pair of vertices completes 0(1) copies of Kt\{k — 2)e. 

(iii) The probability of failure in m rounds is o(l). 

Again, we separate the proofs of the claims for clarity. At several points, we require certain inequalities 
whose rather tedious proofs would interfere with the exposition. The appendix contains the precise 
formulations and proofs of these statements. 

Proof of (i). Lemma lA.31 verifies that Kt \ se is a balanced graph, and the k = s case of Inequality 
14.11 shows that n^pi^)"^ is a positive power of n, so Theorem 12.61 implies that the number of copies of 
Kt \ se in Gm is 0[n^p^'^^ '^) wep. For the second part of claim (i), Lemma lA.41 verifies that Kt \ se 
has the balanced extension property, and Inequality IA.8I shows that n^^^p^z)"** is a negative power of 
n. So, Corollarv 12.91 shows that there is some constant C such that with probability 1 — o{n~'^^), every 
pair of vertices in Gm completes at most G copies of Kt\{s — l)e. This finishes claim (i). □ 

Proof of (ii). Fix k, x, and y as specified, and let us show that (a) implies (b). First, since every 
graph of the form Kt\{k — 2)e always contains some graph of the form Kt\{k — l)e, (a) immediately 
implies that with probability 1 — o{n~'^^), every pair of vertices completes 0(1) copies of Kt\{k — 2)e; 
this implies the second part of (b). 

It remains to show the first part of (b). Fix some i < m and consider the (i + l)-st round. In this 
round, the strategy will create one or more copies of Kt\{k — l)e only if all r incoming edges span 
pairs that are at least [s — k -\- l)-dangerous (i.e., create copies of Kt\{k — l)e). The number of such 
pairs is at most 0(1) times the number of copies Kt \ ke. Since Gi C Gm-, statement (a) implies that 
with probability 1 — o(n~-^'^), Gi has 0{n^py) copies of Kt \ ke and every pair of vertices completes 
0(1) copies oi Kt\{k — l)e. Call this event Ai, and condition on it. Even after conditioning, incoming 
edges are still independently and uniformly distributed over the Q.{'n?) unoccupied pairs of Gi, so the 
probability that some new copies of Kt\{k — l)e are created in this round is 0(( "^f" )^) . Also, by our 
conditioning, the number of newly created copies of Kt\{k — l)e is still 0(1) even when this occurs. 
Therefore, the number of new copies of Kt\{k — l)e in the {i + l)-st round is stochastically dominated 
by 0(1) times the Bernoulli random variable with parameter O ( ( Y)- Letting i run through all m 
rounds, we see that with probability at least 1 — ^ P > 1 — o{n~'^^^~^^), the number of copies of 
Kt\{k-l)e inGm is 0(1) •Bin[m, 0((^)'')] . Since this binomial has expectation ^ •0((^)''), 



12 



which is a positive power of n by the assumption on x and y, a Chernoff bound imphes that it is 
0{(.n^p){^Y) wep. This finishes (ii). □ 

Proof of (iii). The idea is to apply claim (i), and then to repeatedly apply claim (ii) until we obtain 
a high-probability upper bound on the number of copies of Kt \ e. Then, we complete the proof with 
essentially the same argument as in claim (iii) of the proof of the lower bound for avoiding K4. 

To keep track of the exponents of n and p in the successive upper bounds, define the sequences 
{xs,Xs-i, . . . , xq} and {y^, Us-i, ■ ■ ■ , Uo} as in Inequalitv l4.lt which then verifies that n^'^py^ is a positive 
power of n for every A;G{s — 1,...,!}. Hence we can apply claims (i) and (ii) until we conclude that 
with probability 1 — o(n~^), Gm has 0{nF^py^) copies of Kt \ e. 

Now fix some i and consider the probability that we lose in the (i + l)-st round. The strategy 
fails precisely when all r of the incoming edges span pairs that are s-dangerous (completing Kt), 
and the number of such pairs is at most 0(1) times the number of copies of Kt \ e. Yet since 
Gi C Gm, the previous paragraph shows that with probability 1 — o(n~^), Gi has 0{n^'^p^^) copies 
of Kt \ e. Call this event Bi, and condition on it. Even after conditioning, incoming edges are still 
independently and uniformly distributed over the 0,{n'^) unoccupied pairs of Gi, so the probability 
that all incoming edges complete Kt is 0(( " iJf^ Y) ■ Therefore, letting i run through all m = n'^p/2 
rounds, a union bound shows that the probability that we are forced to complete a copy of Kt is 
P < 0{{n'^p){^^-^Y) +Y.^hBi] = 0(72^0^2^0) + 0(1). This in turn is o(l) because we assumed that 
p ^ n^^ with 9 = xq/uq. This completes the proof. □ 

Upper bound: Let m ^ v?^^ , and let p = 2m We will show that whp, any strategy fails within 
G(m) rounds, which we again break into periods of length m. We may assume that m <^ ?i^~^logn 
without loss of generality. Note that n^^ <^p ^ n^^ log n. 

As in the proof of the upper bound for avoiding Kj^, we will specify a sequence of graphs such that 
each graph is obtained from the previous one by adding a single edge. Let Hi = -ft^^ij |- 1-| (the largest 
bipartite subgraph of Kt), and arbitrarily choose the rest of the sequence {H2, H3, . . . , Hf}, where 
Hf = Kt, by adding one missing edge at a time. So, / = 1 + (2) ~ L|J Til > which is a constant because 
we assumed t to be fixed. Our result follows from the following five claims: 

(i) Gm contains i}{n''p^^^^^) copies of Hi whp. 

(ii) Let A; be a positive integer for which n''~'^p^(^k-i) ^ positive power of n. Then G^m contains 

ri(n*p'^(^''')) copies of H^. whp. 

(iii) G(^f_s-^m contains ^{n^p^^^i-"^) copies of Hf^s whp. Also, n^^^p^^^^^"-* is a negative power of 
n; hence with probability 1 — o(n~^), every pair of vertices in G(^f^s+i)m extends 0(1) copies of 
Hf_s into Hf_s+i- 

(iv) For each /c G {s, s — 1, . . . , 2}, and constants x and y such that rv^p^ <C and {n^p) ( ) ^ is a 
positive power of n, statement (a) implies statement (b), which are defined as follows: 

(a) G(^f_k-^m contains Q.{n^py) copies of Hf^k whp, and with probability 1 — o(n^^), every pair 
of vertices in G(^f_k+i)m extends 0(1) copies of into Hf^i^^i. 

(b) contains ^l[{'n?p) (^^;^)'') copies of Hj^k+i whp, and with probability 1— o(n^^), 
every pair of vertices in G(j„fc_|_2)m extends 0(1) copies of ^^/-a,.+i into -fr/-fc+2- 
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(v) The probabihty of survival through fm = 0(m) rounds is o(l). 

Proof of (i). We will actually prove that Gm contains ^.{n^p^^^^^) copies of Hi with certainty, not 
just whp. However, the rest of the claims only require a whp result in claim (i), so we keep it there 
for the purpose of generality. 

Since we assumed that p ^ n^^ and Inequality IA.6I bounds —9 > — [fj ^, Lemma 12.21 implies 
that the number of copies of the complete bipartite graph Hi = K^i^ |-t-| in any m-edge graph is 
J7(ny(-^i)). □ 

Proof of (ii). We proceed inductively. The base case of the induction follows from claim (i). Now, 
suppose k satisfies the property that n*~y(^'=-i) is a positive power of n, and contains 
j^^^tpe(Hfc_i)-j pQpigg Qf Hfi^i whp. We will show that Gkm contains ^{n^p^^^^^) copies of H]^ whp. 

Let us begin by conditioning on the high-probability event A from our inductive assumption: that 
G[k-i)m contains ^}{n^p'^^^''-'^^) copies of H^-i- Now consider the (i -|- l)-st round, where {k — \)m < 
i < km. Since Gi D the total number of copies of Hk-i in Gi is Q.{'n!'p^^^^-'^'>) by our 

conditioning. 

Lemma [A . 5 1 verifies that (H^-i, H^) is a balanced extension pair, and we assumed that u^-'^p^i^k-i) 
was a positive power of n, so Theorem 12. 8( i) establishes that wep, every pair of vertices in Gkm extends 
Q^^t-2pe(Hk^i)^ copies of Hk_i into H^. Since Gi C Gkm, the same bound holds for Gi wep; call that 
event Bi, and condition on it. 

For a pair of vertices {a, b}, let 71^ 6 be the number of copies of -fffe-i that the pair {a, b} extends 
into fTfc. Recall that this definition does not depend on the presence of an edge between a and b. Let 
us estimate the average value of Uafi over all pairs. Since -fffc-i differs from Hf^ at exactly one edge, 
each copy of has a pair at which it contributes -|-1 to the sum ^ ra^ fe. Therefore, averaging over 
all (2) pairs of vertices, we obtain that the average number of copies of H^-i that are extended to H^ 
at a pair is ^(n*"^^^*-!)). 

On the other hand, every pair of vertices in Gi extends 0(n*~^p'^(^'=-i)) copies of H^^i into H^.. 
Therefore, at least a constant fraction 7 = ^2(1) of all (2) pairs have the property of extending 
0(n*~^p^(^'=~i'') copies of Hk-i into Hk- Let P be the set of all such pairs. Regardless of the choice 
of strategy, if all r incoming edges span pairs in P, we will be forced to create i}{n^~'^p^^^''-^^) copies 
of Hf^. Since i = o{v?) = o(|P|) and incoming edges are uniformly distributed over the (2) ~ ^ = 
(1 -|-o(l))(2) unoccupied pairs, we conclude that the probability that all incoming edges span pairs in 
P\sq> (l + o(l))7^ = 

Let i run from [k — l)m to km. Then, up to an error probability of at most P [-^A\ -|- [-^Bi\ = 
0(1), the number of copies of Hk in Gkm is at least Bin(m,g) • Q.{'n}~'^p^'^^^-'^'>). By the Chernoff 
bound, the binomial factor exceeds mq/2 = Vi{p?p) wep; thus, whp Gkm has Q,{n'^p ■ n^~'^p^(^k-i)^ = 
^[n^p^i^k)^ copies of Hk. □ 

Proof of (iii). The first part follows directly from claim (ii), because Inequality I A. 71 verifies that 
^t-2pe(H(^_5)_i) -g ^ positive power of n. For the second part, {Hfs, Hfs^i) is a balanced extension 
pair by Lemma \A.5\ and n^~'^p^(^f-'>^ is a negative power of n by Inequalitv lA.Si Therefore, Theorem 
I2.8l fii) shows that there is some constant C such that with probability 1 — o{n~'^), every pair of vertices 
in G'(j_s_|_i)m extends at most C copies of Hf_g into Hf^^+i- This finishes claim (iii). □ 

Proof of (iv). Fix k, x, and y as specified in the statement, and assume statement (a). Let us 
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begin by establishing the second part of (b). Lemma fA . 5 1 verifies that {Hf^k+i, Hf-k+2) is a balanced 
extension pair, and Inequality IA.8I shows that v}~'^p^^^f-k+i) is a negative power of n for k < s. 
Therefore, Theorem 12. 8( ii) shows that there is some constant C such that with probability 1 — o(n~^), 
every pair of vertices in G(^j_j^_^2)m extends at most C copies of Hj_k_^_i into Hj_k^2- This finishes 
the second part of (b). 

It remains to prove the first part of (b). Consider the {i + l)-st round, with (/ — k)m < i < 
{f — k+ \)m. Regardless of the choice of strategy, if all r incoming edges span pairs that extend copies 
of Hf_k into we will create a copy of Hf_k^i. Let P be the set of all such pairs. We need a 

lower bound on \P\. 

Condition on the high-probability event C of (a) that contains n{n^py) copies of Hf^k- 

Since Gi C G(^f_k+i)m^ (a) implies that with probability 1 — o(n~^), every pair of vertices in Gi extends 
0(1) copies of Hf^k into Hf^^+i- Call this event Di, and condition on it. 

Note that every copy of Hf_k contributes a pair to P which extends Hf-k into Hf_k+i, namely 
the pair at which it is missing an edge compared to Hf-k+i- On the other hand, every such pair 
was counted at most a constant number of times, since every pair in Gi extends 0(1) copies of Hf-k 
into Hf-k+i- This implies that |P| = Vt{n^py). The incoming edges are uniformly distributed over all 
unoccupied pairs. If at least half of the pairs in P were occupied, then we would have Q.{n^py) copies 
of Hf^k+i- Yet this would already give us the conclusion of (b) since: 

„V»(„^p)(!5^)»(nV)(^)^ 

(The first inequality is because p <ti 1, and the second inequality follows from the assumption that 
fi^pV <^ Otherwise, if less than half of the pairs in P are occupied, then the probability that 

all incoming edges span pairs in P (hence forcing the creation of a copy of Hf^k+i) is Q > (1 + 

Letting i run from (/ — k)m to {f — k + l)m, we see that with error probability at most P [-■C] + 
[-^Di] = 0(1), either we already obtained the conclusion of (b), or the total number of copies of 
Hf-k+i is at least Bin(m,g). The expectation of the binomial is {J^)q = ^[{n'^p){^^^:^y) , which is 
a positive power of n by assumption. Hence, by the Chernoff bound, G(^f_k+i)m has ^l[{n'^p)[^^^^y) 
copies of Hf_k+i whp. □ 

Proof of (v). The result of claim (iii) plugs in directly to claim (iv), which we may iterate until it 
gives us a a lower bound on the number of copies of -f^/-i = Kt\e and an upper bound on the number 
of copies of Hf_i that any pair extends into Hf = Kf. 

To keep track of exponents in the successive lower bounds, define the sequences {xs,Xs-i, ■ ■ ■ , xq} 
and {ys,ys-i, ■ ■ ■ , Ho} exactly as in Inequality 14. li To verify that we can indeed iterate claim (iv), we 
must show that for all /c G {s, s — 1, . . . , 2}, we have that n^'^py^ <c n^, and n^k^ipyk-i ^ positive 
power of n. The first statement follows from an easy induction: claim (iii) establishes it for k = s, and 
if n^t^pyk <^ 77,2^ then " ^ 1, so combined with p ^ 1, we see that rfk^ipVk-i = (n^p)( " ^ 
n?. The second statement is verified by Inequality 14. 11 Therefore, we arrive at the result that 
contains r2(n^^p^^) copies of = Kt\e whp. Call this event E, and condition on it. We also 

find that with probability 1 — o(n^^), every pair of vertices in Gfm extends 0(1) copies of -f^/-i into 
Hf (i.e., completes 0(1) copies of Kt). The same probability bound also holds in Gi for any i < fm, 
because Gi C Gfm] let Fi be the corresponding event. 
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Now consider the (i + l)-st round, where (/ — l)m < i < fm. Regardless of the choice of strategy, 
if ah r incoming edges span pairs that complete copies of Kt, we will lose. We can bound the number 
of such pairs by ri(n^ip^i) by conditioning on the above events E and Fj. Even after conditioning, 
the incoming edges in this round still independent and uniformly distributed over the (2) ~ ^ = 0(n^) 
unoccupied pairs of Gj. Therefore, the probability that all r pairs complete Kf, conditioned on survival 
through the i-th round, is pi = ^(( " lS"^ y)- Letting i run from (/ — l)m to fm, we see that the 
probability that any strategy can survive for fm rounds is at most 

P < F[-^E] + J2^[-^Fi\ + l[{l-pi)<o{l) + e^p[-^p^} 

< 0(1) + exp { - ((n^p) (J^-^) ') } = 0(1) + exp{-J^(n>^°)}. 
This in turn is o(l) because we assumed that p ^ with 6 = xo/yo- This completes the proof. □ 

5 Abstraction into general argument 

Note that we structured our exposition of the previous section in the following manner. The arguments 
did not directly use properties of the specific graph that we were avoiding {Kt). Rather, they were 
linked to lemmas and inequalities that proved certain properties (e.g., balanced-ness, etc.) about Kf. 
Let us now isolate these necessary "ingredients" that one can plug in to our general machinery to 

prove thresholds. 

For the rest of this section, let H be the fixed graph which we wish to avoid. Our arguments 
allow one to prove the threshold for avoiding H in the Achlioptas process with parameter r simply 
by specifying several parameters, and then proving some lemmas and inequalities that do not need to 
refer to the Achlioptas process at all. We first describe the parameters. 

• s: this was the number of levels of danger considered by the avoidance strategy in the proof of 
the lower bound. At any intermediate stage in the process, for any 1 < d < s, we considered a 
pair of points to be d-dangerous if d was the maximal integer such that the addition of an edge 
between them created a copy oi H\{s — d)e. If there was no such d, we considered the pair to be 
0- dangerous. Recall that the strategy was then to make an arbitrary choice among the incoming 
edges that were minimally dangerous. 

• A sequence of graphs {Hi, . . . ,Hf} sharing the same vertex set, with each successive graph 
containing exactly one more edge: this was used in the upper bound argument to iteratively 
prove lower bounds on the number of copies of Hi, proceeding from i = 1 to i = f. 

The correct choice of s then determined 9, the negative exponent in the threshold (in terms of p) for 
avoidance: 

^_ r%v{H)-2) + 2 
rs ^e{H) - + ^ ■ 

Assuming that the parameters were suitably chosen, one then only needed to establish the following 
lemmas and inequalities in order to prove that the threshold for avoiding H in the Achlioptas process 
with parameter r is n^~^ . 

For proof of lower bound. Here, / log n <^p <^ . 
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1. H\se is a balanced graph. This allowed us to prove in claim (i) that wep, Gm has 0{n"^^^p^^^^ ^) 
copies of H \ se. For H = Kt, this was provided by Lemma |A.3[ 

2. H \se has the balanced extension property, and n'"^^^~'^p'^^^^~^ is a negative power of n. This 
allowed us to prove in claim (i) that with probability 1 — o(n~^'^), every pair of vertices in Gm 
completes 0(1) copies of H \ (s — l)e. For H = Kt, these were provided by Lemma lA.41 and 
Inequality IA.8I 

For proof of upper bound. Here, <C p ^ logn. 

1. Gm contains n{n"^^^'> p^^^^^) copies of Hi whp. This was claim (i), and for H = Kf, it was 
provided by the extremal estimate on the number of Kg^t (Lemma I2.2p . along with Inequality 
\A.6\ which assured that p was large enough to apply the extremal result. 

2. Each consecutive pair {Hi^., H^.^i) is a balanced extension pair. This was used throughout the 
proof of the upper bound, and for H = Kt, it was provided by Lemma lA.51 

3. fi'"^^^~'^p^^^^~'^^^ is a positive power of n. This was used in claim (iii) to show that we 
could iterate the argument of claim (ii) enough times to conclude that G{^f_g-^m contained 

copies of Hf^s whp. For H = Kt, this was provided by Inequality I A. 71 

4. n'"^^^~'^p^^^^~^ is a negative power of n. This was used in claim (iii) to transition to the next 
inductive process, which relied on the copies of not being too concentrated on any pair of 
vertices. Note: this statement was already required above for the lower bound, so we do not 
need to check it again. 

6 Avoiding cycles 

Now we show by example how to use our machinery to prove avoidance thresholds. We start with an 
easy application which completely solves the problem for cycles Ct. In light of the previous section, we 
only need to provide the required parameters, lemmas, and inequalities. We will specify these in the 
same order that they were presented in the previous section. This will prove the following theorem. 

Theorem. For t > 3, the threshold for avoiding Ct in the Achlioptas process with parameter r > 2 is 

r, r(t-2) + 2 
fl r(t-l) + l _ 

Proof. We use the parameter s = 1, and the sequence of graphs Hi = Ct\e, H2 = Ct- This gives the 
threshold n^~^, where 6 = ^, ^'"^'^*^~'^^ts^ 1 = 47-TrrT, which matches the claimed result. Now we need 

' r^ie{Ct)-s)+^^ r(t-l)+l' 

to provide the required lemmas and inequalities. For the reader's convenience, we have reproduced 
the italicized statements from Section [5j 

For proof of lower bound. Here, n^^ / log n -^p <^ n^^ . 

1. Ct\e is a balanced graph. This is obvious. 

2. Ct\e has the balanced extension property, and ji"{Ct)-2pe{Ct)-i _ ^t-2pt-i ^ negative power 

r(t~2) + 2 

of n. The first part is obvious. For the second, since p <^ n , we must establish that 
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{t — 2) — {t — 1) < 0. Routine algebra shows that the left hand side equals — , 

which is certainly negative when t > 3, r > 2. 

For proof of upper bound. Here, n^^ ^ p ^ n^^ log n. 

1. Gm contains Q{n'"^^''-^p'^^^^^) copies of Hi whp. The average degree of Gm is precisely np by the 
definition of p = 2m /n^. We show in item ^3 below that np is a positive power of n, so it tends 
to infinity with n. Thus, we may apply Lemma |2.H an extremal result counting the number of 
paths, to conclude that Gm contains at least (1 + o(l))n(np)*~^ copies of the t-vertex path Hi, 
as desired. 

2. {Hi,H2) is a balanced extension pair. This is easy to see. 

3. j7,^{C*)-2pe(Ct)-i-i _ (jipY^'i ig Q positive power of n. It suffices to show that np is a positive 

r(i~2)+2 rft-2")+2 

power of n. Since p^ n r{t-i)+i ^ i^j^jg amounts to proving that 1 — > 0. Routine algebra 

shows that the left hand side equals ^f^^^Si^^i > which is certainly positive when t > 3, r > 2. 

As we have provided all of the necessary ingredients to apply our machinery, we are done. □ 



7 Avoiding Kt^t 

Now we show a more complex application of our machinery, which completely solves the problem for 
Kf^f This will prove the following theorem. 

Theorem. Suppose that t > 3 and r > 2 are fixed integers. The threshold for avoiding Kt^t in the 
Achlioptas process with parameter r is n^^^ , where 9 is defined as follows: 

«=Liog.[(r-i)t+i]j, ^= ^.(,L,);ri^ - 

7.1 Parameters 

The value of s is already specified in the statement of the theorem, so we proceed to give the sequence 
of graphs {Hi, . . . , Hf}. The sequences are quite different depending on the parity of t, so we describe 
them separately. 

Case 1: t is even. Let Hi be the 4-partite graph with parts Vi, V2, V3, V4, each of size t/2, and edges 
such that {Vi, V2), {Vi, V4), and (V3, V2) are complete bipartite graphs. Let {H2, . . . , i/i+(t/2)} be 
obtained by successively adding single edges until -ffi+(t/2) has a perfect matching between V3 and 
V4. Then, arbitrarily choose the rest of the sequence {-ff2+{i/2) j • • • > Hj} by adding one edge at a 
time, until the final term is the complete bipartite graph Kt^t with bipartition (Vi U V3, V2 U V4). 
Note that f = l + t'^/i. 

Case 2: t is odd. Let Hi be a 6-partite graph with parts {V^}^ such that V3 and V4 are singletons, 
and the other four parts each have size [t/2j. The edges are as follows: the two pairs {Vi,V2) 
and (V5, Vq) are each complete bipartite graphs, the vertex in V3 is adjacent to all of V2U V4U Vg) 
and the vertex in V4 is adjacent to all of Vi U V3 U V5. There are no more edges. 
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Let {if2, • • • , -ffi+[t/2j } be obtained by successively adding single edges until -ffi+[4/2j has a 
perfect matching between Vi and Vq. To create the next [t/2\ graphs in the sequence, we put 
down a matching between V5 and V2, one edge at a time. Finally, arbitrarily choose the rest of the 
sequence {-ff2+2[t/2j i • • • > Hj} by adding one edge at a time, until the final term is the complete 
bipartite graph Kt^t with bipartition {Vi U V3 U 14, V2 U V4 U Ve)- Note that / = 1 + 2[t/2j2. 

7.2 Lemmas and inequalities 

Next, we provide the required lemmas and inequalities. For the reader's convenience, we have repro- 
duced the italicized statements from Section [5l 

For proof of lower bound. Here, n^^ / log n <^ . 

1. Kt^t \se is a balanced graph. This is now provided by Lemma iB.ll Actually, the graph is not 
balanced when t = 3 and r = 2, but in that particular case. Lemma [BT] additionally proves that 
the number of copies of Kt^t \ se in Gm is still 0{n<^^p^^^')-') wep, which is all we really need. 

2. Kt^t \ se has the balanced extension property, and n^(^*.*)~2pe(_ft:t,t)-s ^ negative power of n. 
These are now provided by Lemma IB. 21 and Inequality IB. 81 

For proof of upper bound. Here, n^^ <C p ^ n^^ log n. 

1. Gm contains Q{n'"^^^^p'^^^^^) copies of Hi whp. This time, we use Inequalitv IB. 61 to show that 
—9 > —2/t. Since we assume that p ^ for the upper bound argument, this provides the 
condition required to apply either Lemma [7. II if t is even, or Lemma [7. 2 1 if t is odd. Both lemmas 
(presented below) lead to the required final statement. 

2. Each consecutive pair {Hk, H^^i) is a balanced extension pair. This is now provided by Lemma 
IB. 41 if t is even, and by Lemma IB. 51 if t is odd. 

3. is a positive power of n. This is provided by Inequalitv IB. 71 

7.3 Proofs of supporting lemmas 

We conclude this section by proving the two lemmas that provide the first component of the proof of 
the upper bound. We start with the lemma that is used when t is even. 

Lemma 7.1. For any fixed positive integers k and I, consider the following 4-pO''r'tite graph, which we 
call H. Let the parts be Vi,V2,Vs,V4, with \Vi\ = \V2\ = k and \Vs\ = IV4I = /, and place edges such 
that {Vi,V2), (yi,Vi), and (V2,V3) are complete bipartite graphs. There are no more edges. Then, 
there exists a constant Ck such that for any p ^ n~^/^ , every graph with n vertices and (^)p edges 
contains at least {ck + o(l))n^'^+^y^"'"^'^' copies of H . 

Proof. Let us fix an ambient graph G with n vertices and {^p edges. By Lemma |2.2| the number 
of copies of -fC/cfe in G is at least (1 + o{l))n'^^p^^ . Recall that the /c-codegree of a set U of k distinct 
vertices is the number of vertices that are adjacent to all of U. Let us say that a copy of Kk^k is 
deficient if either of the sides of its bipartition has /c-codegree less than ^np^ in G. We claim that at 
most ^ + 0(1) of the copies of K^^k are deficient. 
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To see this, note that if an ordered /c-tuple of distinct vertices has fc-codegree less than ^np^, then 
it can extend to at most (^np'^)'^ copies of K^^k- The number of such /c-tuples is at most n^; therefore, 
the number of deficient copies of Kk^k is at most n^(^^np^^^ < ^n'^^p'^^ , as claimed. 

Yet each non-deficient copy of Kk^k extends to at least 

f^np'' - 2k\ „ /^np^ -2k -I 

copies of H. This is because we may consider the copy of Kk^k to be V1UV2, we choose V3 from the 
common neighborhood of V2 excluding the 2k vertices in Vi DV2, and finally we choose V4 from the 
common neighborhood of Vi excluding the 2k + I vertices in Vi U V2 U V3. Since we assumed that 
p ^ n^^/*^, the binomial coefficients are asymptotically monomials of degree /, so we conclude that 
each non-deficient copy of Kk^k extends to Vl{{np^ f ■ (np^'Y) = r2(n^'p^'^') copies of H. Since there are 
always at least (| -|- o{l))n^^p^^ non-deficient copies of Kk^k, we conclude that the number of copies 
of H is always ft{n'^''~^'^'-p^^~^'^'''-), as claimed. □ 

Using Lemma |7. II as a building block, we now prove the lemma that provides the first component 
of the upper bound when t is odd. Actually, we prove a result for G2m instead of Gm, but this does 
not matter for the purpose of the general argument. 

Lemma 7.2. Let k be a positive integer. Let H he a 6-partite graph with parts {Vi}\ such that V3 
and V4 are singletons, and the other four parts each have size k. Let the edges of H be as follows: the 
two pairs {Vi, V2) and (V5, Vq) are each complete bipartite graphs, the vertex in V3 is adjacent to all of 
V2DV4UVQ, and the vertex in V4 is adjacent to all ofVi U V3 U V5. There are no more edges. 

Consider G2m, the graph after the 2m-th round of the Achlioptas process with parameter r > 2. 
Let p = 2m/n^, and suppose that p » n^^ with —9 > —l/{k + i). Then G2m contains Q{n"^^^p^^^^) 
copies of H whp. 

Proof. Let Hi be the subgraph of H induced by Vi U V2 U V3 U V4, and let Hq be the subgraph of 
Hi with the edge between V3 and V4 deleted. Observe that we can find a copy of i7 in a graph by 
first looking for a pair of vertices for the site of the edge between V3 and V4, and then looking for two 
disjoint copies of Hq that are extended into Hi by that pair. 

Consider the (i + l)-st turn, for some m < i < 2m. By Lemma 17.11 Gm (and hence Gi D Gm) 
always contains ^(n'^^^'^p''^'^^^) copies of Hq. Lemma [B . 3 1 verifies that (-f^O) Hi) is a balanced extension 
pair, and ji^^p^^'^'^^ is a positive power of n because we assumed that p ^ with —0 > —l/{k + |) 
and A: > 1. Thus, Theorem I2.8f i) establishes that wep, every pair of vertices in Gi C G2m extends 
Q^^2kpk'^+2k^ copies of Hq into Hi. Call this event Ai, and condition on it. 

For a pair of vertices {o, b}, let Ua^b be the number of copies of Hq that the pair {a, b} extends into 
Hi. Recall that this definition does not depend on the presence of an edge between a and b. Let us 
estimate the average value of Ua^b over all pairs. Since Hq differs from Hi at exactly one edge, each 
copy of Hq has a pair at which it contributes -|-1 to the sum Yl''^a,b- Therefore, averaging over all (2) 
pairs of vertices, we obtain that the average number of copies of Hq that are extended to Hi at any 
pair is Q(n^'^+^p'^^+^^). 

On the other hand, by our conditioning, every pair of vertices in Gi extends 0{n'^^p^ ^^^) copies 
of Hq into Hi. Therefore, at least a constant fraction 7 = ^^(1) of all (2) pairs have the property of 
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extending Q.{'n?^p^ +'ik'^ copies of Hq into Hi. Let P be the set of all such pairs. Regardless of the 
choice of strategy, if all r incoming edges span pairs in P, we will be forced to choose a pair in P. 
This will create r2((n^^p'^^"'"^'^)^) = Q.{n^^p'^^^^'^'^) pairs of copies of Hq that are extended to Hi by 
the chosen pair. Such a pair of copies of Hq would become a new copy of H after the edge is added, 
if the pair of copies were disjoint. If the pair of copies of Hq is not disjoint, then let us say that they 
create a degenerate copy of H. For now, let us count degenerate copies of H along with the true copies 
of H. Later, we will show that the degenerate copies are vastly outnumbered by true copies of H. 

Since i = o(n^) = o(|P|) and incoming edges are uniformly distributed over the (2) — i = (1 + 
0(1)) (2) unoccupied pairs, we conclude that the probability that all incoming edges span pairs in P 
is g > (1 + o(l))7^ = r2(l). Let i run from m to 2m. Then wep, the number of (possibly degenerate) 
copies of H in G2m is at least Bin(m, q) ■ nin'^^p'^^^^'^^). By the Chernoff bound, the binomial factor 
exceeds mq/2 = Q.{'n?p) wep, so we conclude that G2m has Vl{Tn?p ■ ■fi'^kp^^^'^'^^) = Qin"^^^ p^^^^) 
(possibly degenerate) copies of H vi^hp. 

To finish the proof of this lemma, we must show that the number of degenerate copies of H in G2m 
is o{n^^^^p^^^^) whp. For convenience, we will work with G{n,p) instead of G2m because Lemma 
12.41 shows that we may couple G2m with G{n,Arp), and the constant 4r disappears under the "o(-)" 
notation. Note that the underlying graph of a degenerate copy of is a superposition of two copies 
of Kk+i^k+i-, overlapping on at least 3 vertices. So, let us consider any such superposition, and call 
the underlying graph F. Let v' = v{F) and e' = e{F). The copies overlap on at least 3 vertices, so 
v' < v{H). It is easy to check that since -ft^/t+i^fc+i is a balanced graph, ^ > So, the expected 

number of copies of F in G{n,p) is: 

E < n-'p^' = (np^'/^'y' < {np'^^y-^^Y = (n-(^)p^(^))-'/«(H)_ 

Now, we assumed that p » n"^/('^+5), so n^Wp^'^^') > 1 because v{H) = 4k + 2 and e{H) = 2P + 
4k + 1. Furthermore, v' < v{H), so Markov's inequality implies that whp, G{n,p) has o{n'"^^^p'^^^^) 
copies of F. Since each copy of F can account for at most a constant number (depending only on k) 
of degenerate copies of H, and there is only a constant number of non-isomorphic superpositions F, 
we conclude that whp, G{n,p) has o{n'"^^^p^^^^) degenerate copies of H. This completes the proof of 
the lemma. □ 

8 Avoiding K4 in the Achlioptas process with parameter 3 

To apply the machinery of Section \5\ one needs to prove that certain quantities are positive or negative 
powers of n. In our study of avoiding cycles, cliques, and complete bipartite graphs, the only case in 
which we encounter a key exponent that is not separated from zero is when we are avoiding K4 in the 
Achlioptas process with parameter 3. 

However, the separation of the exponent from zero was merely a convenience which allowed us to 
bound maxima of families of random variables (e.g., the maximum codegree in a graph) whp. When 
we do not have this condition, we may instead bound the entire distribution of the family. 

Lemma 8.1. Let n~^^^ ^ p <C n^^^^ logn. Then G{n,p) satisfies the following property whp: all 
codegrees are at most np^ log n, and for every integer 4 < k < log n, the number of pairs with codegree 
at least knp^ is at most r? j)^ . 
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This result, which we prove at the end of this section, ahows us to prove our final threshold. 

Theorem. The threshold for avoiding in the Achlioptas process with parameter 3 is r?!'^ . 

Proof. Lower bound: A shortsighted strategy works in this instance: arbitrarily select any one of 
the incoming edges that does not create a copy of -R'4. Let m <C n^/^, and let p = 2m/n? . Again, we 
assume without loss of generality that m ^ v?!'^ j log n. Note that n~^l'^ j log n <^ p <^ n^^^"^. We will 
analyze the performance of our strategy by proving two successive claims: 

(i) Gm has O(n^p^) copies of K4 \ e wep. 

(ii) The probability of failure in m rounds is o(l). 

The interested reader may check that if we followed the recipe for avoiding Kt in Section HI we 
would start by counting copies of K4 \ 2e instead of K4 \ e. This is essentially the only change in the 
lower bound argument, but we provide the details below for completeness. 

For (i), \ e is balanced and n^p^ is a positive power of n, so Theorem 12.61 implies that the 
number of copies of \ e in Gm is 0{n'^p^) wep. 

For (ii), consider the probability that the strategy fails at the {i + l)-st round for some i < m, i.e., 
that all 3 incoming edges span pairs that complete copies of K/j^. The number of such pairs is upper 
bounded by the number of copies of \ e. Since Gi C Gm, claim (i) implies that Gi has O(n^p^) 
copies of \ e wep. Call this event Ai, and condition on it. Then, the chance that all 3 incoming 
edges complete is = 0{n^p^^). Letting i run through all m = n?p/2 rounds, a union 

bound shows that the probability that we are forced to complete a copy of K4 by the m-th round is 
P < 0{n'^p ■ nV^) + E IP bM = 0{n^p^^) + o(l) = o(l), as desired. 

Upper bound: Let m ^ n'^/^, and let p = 2m/n^. We will show that whp, any strategy fails within 
3m rounds, which we break into periods of length m. Again, we may assume that m <^ r?^'^ log n 
without loss of generality. Note that <C p ^ n^^/^logn. Our result follows from the following 

three claims: 

(i) Gm contains Vl(r?^ pairs of vertices with codegree at least 2 whp. 

(ii) G2m contains i7(n^p) copies of K/^ \ e whp, and with probability 1 — o(n~^), every pair of vertices 

in extends 0(1) copies of K\ \ e into K\ 

(iii) The probability of survival through 3m rounds is o(l). 

Proof of (i). In the random graph, the expected codegree is roughly n-p?' ^ 2, but since we do not 
know how far p exceeds n"^/^, we need a slightly more careful argument. Let S be the sum of the 
codegrees E{tj v} '^('"' ^) o'^^^ ^ unordered pairs {n, w}, and let us decompose S = + 52 + 5*3, where 
5*1 is the contribution from summands with d{u,v) E {0, 1}, S2 is the contribution from summands 
with 2 < d{u,v) < Anp^, and 5*3 is the remainder. We aim to show that ^2 = r2(n^p^), which will 
imply the result. 

By double-counting, 5 = ^^2}^ > where d{v) is the degree of vertex v. By convexity, this is always 
at least ^(2), where d is the average degree. Since Gm has exactly m edges, d = 2m/n = up ^ 1. 
Therefore, S > (0.5 + o(l))n(np)2. 
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On the other hand, Lemma 18.11 shows that whp, Gm has the property that ah codegrees are at 
most np^logn, and for every integer 4 < /c < fogn, the number of pairs with codegree at least knp^ 
is at most ra^/fc^. Conditioning on this, we may then bound ^3, the sum of codegrees which exceed 
4np^, by: 

^3 < 
< 

< 
< 

Also, S*!, the sum of codegrees which are in {0, 1}, is trivially at most (2) ^ n^p^ since we assumed 
p ^ n^^/2_ 5*2, the sum of codegrees between 2 and 4np^, is at least S2 = S — Si — > O.OSn^p^. 
Therefore, whp the number of pairs with codegree at least 2 is at least ^.{)biv'p'^ / {-inp^) = Q.{v?), as 
claimed. □ 

Proof of (ii). The second part follows from Theorem I2.8( ii) because (i^4 \ e^K^) is a balanced 
extension pair and n^p^ is a negative power of n. Let us now concentrate on the first part. Conditioning 
on the high probability event in claim (i), we may now assume that in Gm, the proportion of pairs with 
codegree at least 2 is some 7 = Consider the {i + l)-st round, where m < i < 2m. Regardless 

of the choice of strategy, if all three incoming edges span pairs that each have codegree at least 2, 
then we will be forced to create a new copy of K/^^ \ e. Incoming edges are uniformly distributed over 
unoccupied pairs, and the number of occupied pairs in Gi is exactly i = o{n'^). So, since Gj D Gm, the 
probability that all three incoming edges span pairs with codegree at least 2 is > (l + o(l))7^ = Q(l). 

Let i run from m to 2m. Then, the number of copies of -fC4 \ e in G2m is at least Bin(m, g). By 
the Chernoff bound, this exceeds mq/2 = Q{n'^p) wep, so we are done. □ 

Proof of (iii). Consider the (i + l)-st round, where 2m < i < 3m. Regardless of the choice of 
strategy, if all three incoming edges span pairs that complete copies of K4, we will lose. We can lower 
bound the number of such pairs by ^l{n'^p) by conditioning on the following events. Let A be the event 
that G2m contains ^{n'^p) copies of K4 \ e, which occurs whp by (ii). Also by (ii), with probability 
1 — o(n~2)^ every pair of vertices in Gi C Gsm extends 0(1) copies of \ e into K^; call this event 
Bi. 

Even after conditioning, the incoming edges in this round are still independently and uniformly 
distributed over the (2) ~^ = 0(n2) unoccupied pairs of Gi. Therefore, the probability that both pairs 
complete K4, conditioned on survival through the i-th. round, is pi = il((^)'^) = Q(p^). Letting i 
run from 2m to 3m, we see that the probability that any strategy can survive for 3m rounds is at 
most 

P < F[^A] + Y,^hBi] + '[lil-pi)<o{l)+exp{-J2Pi} 
< 0(1) + exp I - Q{n'^p • /)} = 0(1) + e-'^(^) = o(l). 
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which completes the proof. □ 

It remains to estabhsh Lemma |8.H which we used to control the distribution of codegrees in claim 
(i) of the upper bound. 

Proof of Lemma 18.11 Each codegree is distributed as Bin(n — 2,p^), so a union bound shows that 
the probability that some codegree exceeds np^ log n is at most 

P<n2.f J" \ {p^rp' <n^-( ""^f \ = o(l). 



^np^ log nj \ np-^ log n ^ 

Next, fix any 4 < A; < logn, and let X be the number of pairs with codegree at least knp^ . Consider 
an arbitrary vertex v, and let be the number of vertices u ^ v such that the codegree of {v, u} is 
at least knp^ . Note that X = ^^'^Z ■ 

Since d{v) is binomially distributed Bin[n — and np is a positive power of n, the degree d{v) 
is at most l.lnp wep by Chernoff. Condition on this, and condition further on a neighborhood N{v) 
of size d{v). For each w N{v) U {v}, define the indicator random variable ly^ to be 1 if and only if 
the codegree of {v,w} is at least knp"^, or equivalently, if w has at least knp"^ neighbors in N{v). Note 
that because we already fixed N{v), these are independent since they are determined by disjoint 
sets of edges. Yet k > A and np^ » 1, so each has probability 

^ [ — {^f^fip'i^^ — knp'^ ) ~ i^k) ^ 

Therefore, X^ is stochastically dominated by d{v) + Bin[n — 1 — d{v), q]. Since k < logn, a Chernoff 
bound implies that wep, X^ < l.lnp + 2nq = o(n/k^), which gives X = ^J2^v — o{n?/k^). The 
result follows by taking a union bound over all v and 4 < A; < logn. □ 

9 Concluding remarks 

• Although our theorems treat specific graphs (cycles, cliques, and complete bipartite graphs), we 
conjecture that the thresholds for avoiding general graphs H follow from the natural generaliza- 
tion of the recipe that we used. 

To apply our machinery from Section [5l the first thing that we needed to specify was the 
parameter s. This was the number of levels of danger considered by the avoidance strategy in 
the proof of the lower bound. The correct choice of s then determined 0, the negative exponent 
in the threshold (in terms of p) for avoidance: 

Furthermore, it is clear that the threshold for avoiding any fixed subgraph H' C H is a lower 
bound for the threshold for avoiding H itself. This is because any strategy that avoids H' will 
certainly avoid H as well. 

In light of this, we conjecture that the threshold for avoiding H in the Achlioptas process 
with parameter r is n^~^ , where 0* is the minimum value of 6{H',r,s) when s runs over all 
nonnegative integers and H' runs over all subgraphs of H. 
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• Just as in the case of analyzing the Achhoptas process for giant component avoidance [6], one 
can also consider the offline version of the fixed subgraph avoidance problem. In this offline 
version, all random r-tuples of edges arriving during the process are accessible to an algorithm, 
and it can make its choices at each round, relying on the perfect knowledge of the past and 
the future. The question is still how long the algorithm can typically avoid the appearance of a 
copy of a fixed graph H. We expect that in most of the cases there will be a sizable difference 
between the online and the offline thresholds. Here is a sketch of the illustrative case oi H = K3, 
r = 2. For this case we can prove that if m = o(n^/^), then one can whp avoid a copy of 
during the first m rounds in the offline version. This should be compared to the threshold of 
m = v^l'^ for the online version, given by Theorem II. li The argument proceeds as follows. Set 
p = 2m /Tn?. The offline model in this case can be approximated quite accurately by generating 
a random graph G according to the distribution G{n,2m), and then splitting the edges of G 
randomly into m pairs: (ei, /i), . . . , (em, fm)- Denote the above random matching of E{G) by 
TT. We use the following strategy, while processing the pairs {ei, fi): in each pair (ei, fi) choose 
an arbitrary edge not participating in any triangle in G, otherwise pick an arbitrary edge. It is 
obvious that using this strategy we can only lose (i.e. create a triangle) if G contains a triangle 
with edges xi,X2,X3 such that their respective pairings in vr also belong to triangles in G. The 
number of triangles in G is whp of order n'^p^, and therefore the probability of having a triangle 
whose three edges are paired in vr with edges from triangles is at most of order 



It would be very interesting to obtain tight results for the offline small subgraph avoidance 
version of the Achlioptas process for a wide variety of graphs Ji and parameter r. 

• The appearances of the giant component and of a fixed graph are just two instances that have 
been addressed so far in the context of the Achlioptas process. Naturally, one can consider other 
graph theoretic properties as well in this context. We hope to return to questions of this type 
in the future. 

Acknowledgment. The authors would like to thank the referee for suggestions that improved the 
exposition of this paper. 
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A Supporting results for avoiding Kt 

In this section, we collect the supporting lemmas and inequalities that are used to prove thresholds for 
avoiding Kf. We will make claims about properties of simple single- variable functions without proof, 
as they can be verified by routine analytical methods. 

Throughout the appendix, we will set s = [log^[(r — l)t -\- 1]J. We begin by proving some basic 
facts about s. 

Lemma A.l. For fixed t>S, the parameter s is decreasing in r in the range r > 2. 
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Proof. Let t be fixed, and consider the function /(r) = [^^l ^ ■ It suffices to sliow that < 0. 
Calculation yields: 

dl _ (logQ (;-^-log[(^- 1)^ + 1] f 

dr log^ r 

We will show that the numerator is negative. Since t > 3 and r > 2, (r — l)t + 1 > 3r — 2 > 2r, so the 
numerator is less than 

numerator < (logr) (log2r) — 




Now is a decreasing function of r when r > 2, so it is at most log 2; therefore, the entire expression 
is always negative, and / is indeed decreasing in r. □ 



Lemma A. 2. If t > 4 and r > 2, then s < t/2. Furthermore, if t > 5 and r > 2, or if t = A and 
r > 4, then s < t/2. 

Proof. By Lemma lA. 11 if r > 2, then s < [log2(t + 1)J , and one may verify that this is in turn < t/2 
for all t > 4, and < t/2 for t > 5. For the other range, when r > 4, Lemma fA. II gives s < [log4(3t + l)J, 
which is less than t/2 at t = 4. This finishes the lemma. □ 

A.l Balanced graphs and extensions 

Lemma A. 3. For any t > 4 and r > 2, Kt\se is a balanced graph. 

Proof. We must show that the edge density (number of edges divided by number of vertices) of 
Kt\se is at least as large as the edge density of any of its proper induced subgraphs. The edge density 
of Kt \ se is exactly [(2) — s\/t. Lemma [A. 21 established that s < t/2, so the edge density is at least 
[(2) ~ ~ C2^)/(^ ~ ■*-)• final quantity is precisely the edge density of Kt-i, which is an 

upper bound on the edge density of any proper induced subgraph of any t-vertex graph, so we are 
done. □ 



Lemma A. 4. For any t > 4 and r >2, Kt\se has the balanced extension property. 

Proof. Fix any graph G of the form Kt \ se, and let u, v be any two nonadjacent vertices of G. We 
must show that the function e{H)/{v{H) — 2) is maximal at H = G, where H is allowed to range over 
all induced subgraphs of G that contain {u,v}. For any graph H with n vertices that is missing at 
least one edge (e.g., the edge between {u,v}), e{H)/{v{H) - 2) < [(2) - l]/(n - 2) = (n + l)/2. For 
any proper induced subgraph H C G, we then have e{H)/{v{H) — 2) < t/2. 

Yet e{G)/{v{G) - 2) = [(*) - s]/(t - 2), and Lemma [Aj established that s < t/2. Using this 
bound for s, we see that e{G)/{v{G) - 2) > [(*) - |]/(t - 2) = t/2, which matches our upper bound 
for e{H)/{v{H) — 2), so we are done. □ 
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Lemma A. 5. Suppose that t > 4. Let Hi = K^t^ ^±-^, and arbitrarily choose the rest of the se- 
quence {if2,-f^3, • • • ,Hf}, where Hf = Kt, by adding one edge at a time. Then every consecutive pair 
is a balanced extension pair. 

Proof. Consider a consecutive pair (-ff^, H^^i). By the construction, contains a complete bipartite 
subgraph that was Hi; let Vi U V2 be the corresponding partition of the vertex set. Let u and v be the 
endpoints of the edge on which H^ and H^+i differ. Without loss of generality, suppose that u,v ^Vi. 
(They must lie in the same part because H^ already contains all edges between Vi and V2.) Now, 
consider any subsets Ui C Vi and U2 C V2 such that u,v eUi and C/i U C/2 7^ ^1 U V2. Let Hj^ be the 
subgraph of Hk induced by Ui U C/2. It suffices to show that e{Hk)/{v{Hk) - 2) > e{H'j.) / {v{H'j.) - 2). 

Let us denote ui = |C/i|, U2 = \U2\, and let ei and 62 be the respective numbers of edges of Hk 
spanned by Ui and by C/2. Since the number of edges between Vi and V2 is |_|J |"|] = \J^\ , the number 
of edges in Hj^ is at least ei + 62 + [^J • On the other hand, the number of edges in H'j^ is precisely 
ei + 62 + u\U2. Thus, it suffices to show that: 

ei + 62 + |_^J ^ ei + 62 + U1U2 
t — 2 ~ ui + U2 — 2 

Since the quantity (61+62) appears in both numerators but the denominators satisfy t — 2 > ui+U2 — 2, 
it will only strengthen the inequality to increase (61 + 62) to its maximum possible value, (^2^) + (^2^) — 1 
(we must subtract 1 because we assumed that H^ has no edge between u,v G Ui). Hence it suffices to 
show 

t — 2 ui + U2 — 2 

Let X = ui+ 7/2 • The right hand side then simplifies to {x + l)/2. Also, ("2') + ("2") > (2) - [xJ = 
\\ — f 1 • Therefore, it suffices to show the following inequality, which is equivalent to the next two 
inequalities: 

x + 1 
2 

{x+m-2) 
2 

0. 

The expression under the ceiling sign is a quadratic polynomial in .t, so it is minimized at x = t — 1, 
where it takes the value [^J — ^ — \- Since only and 1 are quadratic residues modulo 4, this is 
always at least —1/2, so we are done. □ 



X 

T 



r*-fl-i + L$J 



2 

■ 1 + 



:(1-*) 



t 
2 



> 



> 



> 



A. 2 Inequalities 

For the reader's convenience, we reproduce the definitions of the parameters s and 0: 

r\t-2) + 2 



s= Llog,[(r-l)t+l]J, 



[(*)-.] + 



r-1 

-1 



Inequality A. 6. Suppose that either t>5 and r >2, or t = A and r > 4. Then —6>— [|J 
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Proof. It suffices to prove tlie stronger inequality that ■ ^ < 1. Since tlie denominator of 9 is 
positive, this is equivalent to 



t 



r'{t-2)+t< r' 



+ 



Rearranging terms, this is equivalent to 



t<r''{i-s\ + 



(1) 



Now by the definition of s = [logj,[(r — l)t + 1]J, we must have r^^^ > (r — l)t + 1. Since r, s, and t 
are integers, this implies r^~^^ > {r — l)t + 2, from which we deduce (using r > 2): 



_(r-l)t + 2 ^ *-2^, t-2 t + 2 
> =t > t = . 



(2) 



Since r and s are integers, we in fact have 



Also, since s is always > 1 for t > 1, 



't + 2' 



> 1. 



(3) 
(4) 



Let us now split into several cases, depending on the (integral) values of t and r. 



Case 1: t > 10. Let us bound the terms on the right hand side of inequality ([T]) one at a time. By 
> t/2, and by ^^tty > 0. Also, Lemma lA. II showed that s was decreasing in r, so for 
r > 2, we have s < [log2(t + 1)J . One may routinely verify that [log2(t + 1)J < | — 2 for t > 10. 
Combining all of these bounds, we obtain 



1 



> 



(2)+0 = t, 



as desired. 



Case 2: t > 5 and r > 6. We follow the same method as the previous case. If we use r > 6 in the 
derivation of inequality ([2]), we obtain 



t-2 t-2 5t + 2 5 

>t > t = > -t. 

r 6 6 6 



By (jl]), TTTY" > 0. Also, by Lemma [A. 11 if r > 6, then s < [logQ(5t + 1)J. One may routinely 
verify that [log^iBt + 1)J < f - f for t > 5. Comb ining all of these bounds, we obtain 



s + 



> 



+ = t, 



as desired. 
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Case 3: t = 4 and r > 4. By Lemma fA.li if r > 4, then s < [log4(3t + 1)J. Since we have t = 4, this 
means that s < 1, and in particular, | — s > 1. Yet inequahty bounds > \{t + 2)/2] = 3 
and inequahty dH) bounds ^^fy- > 1, so combining ah of these bounds, we obtain 



t \ -1 

s H 

2 ; r - 1 



>3(l) + l = 4 = t, 



as desired. 

Case 4: remainder. Only finitely many instances remain: when the integers t and r satisfy 5 < t <9 



and 2 < r < 5. A routine check verifies that • | < 1 in all 20 of these cases. 
This completes the proof. 



□ 



Inequality A. 7. For any t > 4 and r > 2, if n , then n p 



t-2jl)-s-l 



is a positive power of n. 



Proof. Clearing the denominator of 9, we see that we must establish the following inequality: 



(t-2) 

The right hand side is equal to 











r' 




+ — r 

r — 1 


> 



s - 1 



r{t - 2) + 2] . 



s - 1 



r{t-2)+2] 



so our initial inequality is equivalent to 

— 1 

(t-2) -> 

r — 1 



r\t-2) + 



2-r\t-2) -2, 



2-r'(t-2) -2, 



so by rearranging terms and ignoring the s term appearing in the bracket on the right hand side, it 
suffices to establish 

it -2) 



> 2 - 2. 
r-1 - \2; 



Dividing through by t — 2, we see that it suffices to show that 



>t + l. 



r - 1 

Since both sides of this inequality are integers, it is equivalent to 

> t. 



r — 1 



Yet by definition of s = Llog^[(r — l)t + 1]J, we must have r^^-^ > (r — l)t + 1 ^ 
completes the proof. 



> t, which 
□ 



Inequality A. 8. Suppose that t > 5 and r > 2, or t = 4 and r > 4. If p n ^ , then n* ^^(a) ^ is a 
negative power of n. 
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Proof. Clearing the denominator of 6, we see that we must establish the following inequality: 



{t-2) 



+ 



- 1 
r — 1 



< 



r{t-2)+2]. 



This is equivalent to 



— 1 

(t-2) < 

r — 1 



2. 



By definition of s = [logj.[(r — l)t + 1]J , we must have r" < [r — l)t + 1. Therefore, the left hand side 
is at most (t — 2)t, so it suffices to show that 



-2t <f - t- 2s. 



But this final inequality is a consequence of Lemma \A.2\ which established that s < t/2. 



□ 



B Supporting results for avoiding Kf^t 

Coincidentally, the definition of the parameter s is exactly the same for avoiding Kt and avoiding Kt^t, 
so we can still use Lemmas lA.ll and lA. 21 (which prove properties of s) in this section. The specification 
of 9 will be different, however. For the reader's convenience, we reproduce the definitions here. 



s= Llog,[(r-l)t + l]J, 



r*(2t-2) + 2 

r-1 



r^{t'^ - s) + 



B.l Balanced graphs 



Lemma B.l. For any t > 3 and r > 2, Kt^t \ se is a balanced graph, except in the case when t = 3, 
r = 2, and the graph is i^2,3 with a pendant edge. In that final case, if p ^ n~^^/^^/ log n, the number 
of copies of that graph in Gm is still 0{n^p'^) wep. 

Proof. We must show that the edge density (number of edges divided by number of vertices) of 
Kt^t \ se is at least the edge density of any proper induced subgraph. The edge density of the complete 
bipartite graph Ka^b is ab/{a + b), which is increasing in both a and b, so the edge density of any 
proper induced subgraph of Kt^t \ se is at most t{t — l)/(2t — 1). On the other hand, the edge density 
of Kt^t \ se is precisely (t^ — s)/{2t), so we must show that 

t{t - 1) - s 
2t-l - 2t 

Clearing the denominators, this is equivalent to 

2t^ - 2t^ < 2t^ -t^ - s{2t - 1). 



Rearranging terms, this is equivalent to 



s < 



2t-l 



Now if t > 4, Lemma lA.21 bounds s < t/2, which finishes the inequality. 
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The only remaining case is t = 3. However, Lemma lA.ll established that the dependence of 
s = [log^[(r — l)t + 1]J on r was decreasing, so s = 1 for r > 3, and s = 2 for r = 2. One may manually 
verify that of all of the graphs of the form K^^^, \ e and i^s^s \ 2e, the only one which is not balanced 
is the deletion from K^^^ of two edges incident to the same vertex, which is i^2,3 with a pendant edge, 
as claimed. Since that graph, which we denote K2^3 + e, arises only when s = 2, this happens only 
when r = 2. 

Now let us bound the number of copies of that graph in G{n,p), when p ^ / log n. In the case 
t = 3, r = 2, we have 9 = — and so n^p^, roughly the expected number of copies of i<'2,3 in the 
random graph, is a positive power of n. So, since -fC2,3 is balanced, Theorem 12.61 bounds the number 
of copies of K2^3 in Gm by 0{n^p^) wep. Also, np is a positive power of n, so we may bound all 
degrees by 2np wep. If both situations hold, we may conclude that the number of copies of K2^z + e 
is 0{n^p^ ■ np) = 0{n^p'^), as desired. □ 



Lemma B.2. For any t > 3 and r > 2, Kf^t \ se has the balanced extension property. 

Proof. Fix any graph G of the form Kt^t \ se, and let n, v be any two nonadjacent vertices of G. We 
must show that the function e{H)/{v{H) — 2) is maximized at H = G, where H is allowed to range 
over all proper induced subgraphs of G that contain {u,v}. Any such H is still bipartite with respect 
to G's bipartition; suppose that it has a vertices on one side and b on the other. Since we assumed that 
is missing at least the edge joining {n,w}, we must have e(ff)/(v(if) — 2) < {ab—l)/{a + b — 2). This 
is increasing in both a and b, so its maximum over proper induced subgraphs H is — 1) — l]/(2t — 3). 
So we must prove that 

t"^ -s ^ i(i - 1) - 1 
2t - 2 ~ 2t - 3 

Clearing the denominators and expanding brackets, this is equivalent to 

2t^ - 3t^ - s{2t - 3) > 2t^ - 4t^ + 2. 
Rearranging terms, this is equivalent to 

t^ - 2> s{2t - 3). 

If t > 4, then Lemma lA.21 implies that s < t/2, which implies the above inequality. On the other 
hand, if t = 3, then Lemma lA.ll established that s < [log2(t + 1)J for r > 2, so we must have s < 2, 
and this also implies the above inequality. Therefore, we are done. □ 



Lemma B.3. For any fixed positive integer k, consider the following 4-p(ii"tite graph, which we call 
Hi. Let the parts be ^4; V2, V3, V4, with \Vi\ = IV2I = k and IV3I = IV4I = 1, and place edges such that 
(^1,^2), (Vi,V4), and (V3,V2) are complete bipartite. There are no more edges. Let LL2 be obtained 
from Lii by adding the edge between V3 and V4. Then {Hi,H2) is a balanced extension pair. 

Proof. Consider any subsets Ui C Vi and U2 C V2, and let L[[ be the subgraph of LLi induced 
by U1UU2UV3U V4. We must show that e{H[)/{v{H[) - 2) < e{Hi)/{v{Hi) - 2). Let a = \Ui\ 
and b = iC/ol- Then, fj^}^ r, = °^+°+^ = -2^ -|- 1 which is increasing in both a and b. Therefore, 

I ^1 ' v{H[)—2 a+b a+b ' ^ ' 

^^i) < k^+k+k _ e(Hi) , 1 |-, 

v{H[)-2 - k+k - v{Hi)-2' ^^"^ ^ 



32 



Lemma B.4. Suppose that t is even and at least 4. Let Hi be the ^-partite graph with parts 
^2,^3,1/4, each of size t/2, and edges such that (Vi,V2), iVi^Vi), and (V3,V2) CLre complete bipar- 
tite. Let {H2, . . . , -ffi+(t/2)} be obtained by successively adding single edges until i?i+(t/2) has a perfect 
matching between V3 and V4. Then, arbitrarily choose the rest of the sequence {-f^2+(t/2)) • • • j-^/} 
adding one edge at a time, until the final term is the complete bipartite graph Kt^t with bipartition 
(Vi U V3, V2 U V4). Then every consecutive pair (Hi^, H/^^i) is a balanced extension pair. 

The proof breaks into two cases, since there are two stages of edge addition. 

Proof of Lemma IB. 41 for k < t/2. Consider a consecutive pair (ff/j, if^+i). By the construction, 
Hk has the following structure. The vertex set is partitioned into Vi U V2 U V3 U V4, with all parts 
of size t/2. The pairs (Vi,V2), {Vi,V4), and (V3,V2) are complete bipartite graphs, and there is a 
(k — l)-edge matching between V3 and V4. There are no other edges. Also, there is a pair of vertices 
u G V3, w € V4, not involved in the {k — l)-edge matching, at which the addition of an edge creates 
Hk+i. Now consider any family of subsets Ui C Vi such that u £ U3 and v £ 11^. Let H'j, be the 
subgraph of Hk induced by UC/,. We must show that e{H'^) / {v{H'^) - 2) < e{Hk)/{v{Hk) - 2). 

For brevity, let a = \Ui\, b = \U2\, c = IC/3I, and d = {U^l. Since the edges between and form 
a matching of at most k — 1 edges which does not involve u £ U3 or v £ U4, there can be at most 
minjc — 1, d — 1, A; — 1} = min{c, d,k} — 1 edges there. Therefore, 

e{H'j^) ^ ab + ad + cb + (min{c, d, k} — 1) 
v{H'^) - 2 - a + b + c + d-2 ' 

Let us simplify the bound by removing the variables a and b from the inequality. To do this, we show 
that the bound is nondecreasing in both a and b; then, we may replace them with their maximum 
value t/2. Calculating the partial derivative of the right hand side: 

d _ {a + b + c + d-2){b + d) - {ab + ad + cb+{mm{c,d,k} -I)) 
da ~ (a + 6 + c + (i-2)2 

numerator = {b + d — 1)^ + cd — minjc, d, k} 
> cd — c, 

which is always nonnegative because v £ U4 ^ d > 1. An analogous result holds for ^ by symmetry. 
Hence we may substitute a = 6 = t/2to obtain the bound 

e{H'^) ^ ^ + |(c + d) + minjc, d,k} -1 _ t_ + t + minjc, d, fc} - 1 

v{H'^) - 2 - t + {c + d)-2 ~ 2^ t + {c + d)-2 ' ^' 

We reserve the case k = t/2 for separate treatment at the end. 

Case \: k < t/2. Using the bound minjc, d,k} < k in ([5]), we obtain: 

ejH',) ^ t -$+t + fc-l 
v{H'^)-2-2 t + {c + d)-2 

Note that if we substitute c = d = t/2 (their maximum values) in this bound, we obtain precisely 
e{Hk) / {v{Hp/) — 2). Therefore, it suffices to show that the bound is nondecreasing in c and d. 
We accomplish this by showing that the numerator of the large fraction is always < 0. 
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We assumed in this case that k < t/2, but in fact, since k is an integer and t is even, we actually 
have k < | — 1- Substituting this into the numerator: 

-j + t + k-l<-j + t+(^l-l^-l = -\[it-3f-l]. 

Since we assumed that t > 4, this is always < 0, so the bound is indeed nondecreasing in c and 
d. This finishes this case. 

Case 2: k = t/2. This time, we bound mm{c,d,k} < [c + d)/2 in Letting x = (c + (i)/2, we have 

v{H'^) -2-2 t + 2x-2 

Note that if we substitute x = t/2 (its maximum value) in the bound, we obtain precisely 
e{Hk) / {v{Hk) — 2). Therefore, it suffices to show that the bound is increasing in x. Taking the 
numerator of the partial derivative of the fraction involving x: 

d f t^ \ t 

numerator oi — = (t + 2x - 2) - [ — - + t + a:-12=-(t-2). 
ox \ 4 / 2 

Since we assumed i > 4, this is always positive, so the bound is indeed increasing in x. This 
completes the proof. □ 



Proof of Lemma IB. 41 for k > t/2. Consider a consecutive pair {Hk, Hk+i). By construction, 
has the following structure. The vertex set is partitioned into Vi U V2 U V3 U V4, where each part has 
size t/2. The pairs {Vi, V2), {Vi, V4), and (V3, V2) are complete bipartite graphs, and there is a perfect 
matching between V3 and V4. There may be some more edges as well between V3 and V4, but there 
are two vertices u € and u G V4 such that there is no edge between u and v. There are no more 
edges in the entire graph. Also, -fffc+i is obtained by Hf^ by the addition of just the edge joining u and 
V. Now, consider any family of subsets Ui C Vi such that u & U3 and v G U4. Let H'f, be the subgraph 
of Hk induced by UUi. We must show that e(F^)/(v(i?^) - 2) < e{Hk)/{v{Hk) - 2). 

For brevity, let a = \Ui\, b = \U2\, c = lU^l, and d = {U^l. Let E be the number of edges between 
Us and U4 in i^^. Then 

e{H'f,) _ ab + ad + cb + E 
v{H'j^) - 2 ~ a + b + c + d - 2' 

Next, recall that Hk contained a perfect (t/2)-edge matching between V3 and V4. The maximum 
number of edges of this matching that are included in E (i.e., go between C/3 and U^) is min{c, d} < 
(c + d)/2. Therefore, the number of edges in Hj. between V3 and V4 is at least E + ^ — Since the 
rest of Hk consists of 3 complete bipartite subgraphs Kf^2,t/2- 

ejHk) ^ 3^ + j? + |-^ 
v{Hk) - 2 - 2t-2 

and so it suffices to show that 

ab + ad + cb + E ^ 3^ + g+ f - ^ 
a + b + c + d-2 - 2i - 2 ' 
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Observe that E appears in the numerator on both sides. Since the denominator on the left hand side is 
always less than or equal to the denominator on the right hand side, we may replace E (on both sides) 
by its maximum possible value and only sharpen the inequality. This maximum is |f73||f74| — 1 = cd—1, 
because we assumed that had no edge between u £ and v G U4. Thus it suffices to show that 



ab + ad + cb+ {cd - 1) ^ 3^ + (cd - 1) + | - ^ 



c+d 



a + b + c + d-2 - 2t-2 
Note that the variables a and b appear only on the left hand side. We will show that the left hand side 
is nondecreasing in each of a and b, which will then allow us to replace each of them by their maximum 
value (t/2) and only sharpen the inequality. Let us verify this by taking the partial derivative of the 
left hand side with respect to a: 

ab + ad + cb+{cd-l) _ {a + c){b + d)-l 

a + b + c + d-2 ~ {a + c) + {b + d) -2 

d_ ^ [{a + c) + {b + d) -2]{b + d) - [{a + c)(6 + d) - 1] 

da ~ {a + b + c + d-2)^ 

numerator = {b + df - 2(5 + d) + 1, 

which is a perfect square, hence always nonnegative. Thus the left hand side was indeed nondecreasing 
in a. By symmetry, the same is true for b, and so it now suffices to prove: 

$ + i(c + d) + (cd - 1) ^ 3$ + (cd - 1) + 1 - ^ 
t + (c + d) - 2 - 2t-2 

Now let X = {c + d)/2. Then, cd < . Observe that cd appears in both the numerator of the left 
hand side and the right hand side, but the denominator on the left is at most the denominator on the 
right. Therefore, we may increase cd to x"^ on both sides, and only sharpen the inequality. Thus, it 
suffices to show that 

f + |(2x) + (x^-l) ^ 3$ + (x^-l) + i-x 
t + {2x) - 2 - 2t-2 

The left hand side simplifies: 
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+ f(2,T) + (x2-l) _ (*+x) -1 _(| + x) + l 



t + {2x)-2 2[(| + x)-l] 2 

so it suffices to show that 

(I + .t) + 1 3f + (x^-l) + |-a; 
2 - 2t-2 
Clearing the denominators by multiplying by 2t — 2, we see that it suffices to show that 

- + X + ij (t - 1) < 3- + (x^ - 1) + - - X. 

Expanding the brackets and collecting all remaining terms on one side, this is equivalent to 

Q<x^ -xt+—. 
~ 4 

Recognizing the right hand side as the perfect square (x — |)^, we are done. □ 
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Lemma B.5. Suppose that t is odd and at least 3. Let Hi be a 6-partite graph with parts {V^jf such 
that V3 and V4 are singletons, and the other four parts each have size [t/2\. Let there he edges he 
such that the two pairs (Vi, V2) and (VsjVe) are each complete bipartite graphs, let the vertex in V3 he 
adjacent to all 0/ V2 U V4 U Vg, and let the vertex in V4 he adjacent to all of Vi U V3 U V5. There are no 
more edges. 

Let {H2, . . . ,Hi^\t/2\\ be obtained by successively adding single edges until Hi^^f/2\ ^^ts a per- 
fect matching between Vi and Vg. To create the next \t/2\ graphs in the sequence, we put down a 
matching between V5 and V2, one edge at a time. Finally, arbitrarily choose the rest of the sequence 
{i?2+2[t/2j 1 • • • ) Hf} by adding one edge at a time, until the final term is the complete bipartite graph 
Kt^t with bipartition {Vi U V3 U V5, V2UV4U Vq). 

Then every consecutive pair (Hi;., H^+i) is a balanced extension pair. 

The proof breaks into three cases, since there are three stages of edge addition. 

Proof of Lemma IB. 51 for k < Lt/2j. Consider a consecutive pair (iJ^, iJfc+i). By the construction, 
-fffe has the foUowing structure. The vertex set is partitioned into {Vi}]", with IV3I = IV4I = 1 and ah 
other \Vi\ = lt/2\. The pairs {Vi,V2) and [V^jVq] are each complete bipartite graphs, the vertex in 
V3 is adjacent to all of V2 U V4 U Vg, the vertex in V4 is adjacent to all of 14 U V3 U V5, and there is a 
(k — l)-edge matching between Vi and Vg. There are no other edges. Also, there is a pair of vertices 
ti € Vi, f G Vfj, not involved in the {k — l)-edge matching, at which the addition of an edge creates 
Hf^^i. Now consider any family of subsets Ui C Vi such that u £ Ui and v £ Uq. Let H'f^ be the 
subgraph of Hk induced by Uf/j. We must show that e{H'^) / {v{H'f^) - 2) < e{Hk)/{v{Hk) - 2). 

For brevity, let a = h = IC/2I) c = If/al, d = |f74|, e = \U^\, and / = \Uq\. Since the edges 
between Ui and Uq form a matching of at most k — 1 edges which does not involve u £Ui ox v £U%, 
there can be at most min{a — 1, / — 1, A; — 1} = min{a, /, A;} — 1 edges there. Therefore, 



We will show that the bound is nondecreasing in c, which will allow us to replace c with its maximum 
value of 1. To do this, we as usual consider the numerator of the partial derivative with respect to c: 



The first bracket is a perfect square, and the second bracket is always nonnegative because a and / 
are nonnegative integers. Therefore, the bound is indeed nondecreasing in c. By symmetry, the same 
is true for d, so we can substitute c = d = \ va. the bound to obtain: 




< 



ah + ef + c{h + /) + (a + e)d + cd+ (min{a, /, k) - 1) 
a + b + c + d + e + f -2 



numerator of — 
oc 



{a + b + c + d + e + f -2){b + f + d) 

- [ab + ef + c{b + /) + (a + e)d + cd+ (min{a, /, k} - 1)] 
[{b + d+ ff -2{b + d + f) + l]+eb+ [af - min{a, /, k}]. 




< 



ah + ef + {b + f) + {a + e) + l + (minja, /, k] - 1) 
a+6+l+l+e+/-2 
ah + ef + min{a, /, k} 



v(H-) - 2 



< 1 + 



a+b+e+f 
a6 + e/ + min{^^,A;} 
a + h + e + f 
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Let X = (a + h)/2. We apply a standard "smoothing" technique: note that if we replace a = h = x, 
the denominator does not change, but the numerator increases by 



ab + mm ^ — ^ — , k 



a-b\\ . 1 ^ + / 1 . fa + Z 

+ mm < , K > — mm < , k 



a — b\ . ( b — a 



+ min|^^,0 f , 

which is > because a and b are integers. Similarly, letting y = (e + /)/2, we have an analogous result 
for e and /. Therefore, 

e{H',) ^ x^ + ,^+min{^,fe} ^ 



The bound is symmetric with respect to x and y, so we may assume without loss of generality that 
X < y- Our next claim is that increasing x to equal y only increases the bound further. That is, we 
aim to prove the following: 

^2 + y2 + k} ^ y'^ + y'^ + minjj/, k} 



2x + 2y - 2y + 2y 

Cross- multiplying to clear denominators, this is equivalent to 

4x^2/ + 4y^ + 4y min <^ -, k> < Axy"^ + 4y^ + (2x + 2y) min{y. A;}. 



Rearranging terms and using the substitution min{(x + y)/2, k} < min{y, k} (we assumed x < y), it 
suffices to show 

< (y - x)(4xy - 2min{y, fc}). (8) 

Yet X = {a + b)/2 and u G ?7i =^ a > 1, so we must have x > 1/2. This, along with y > x and 
min{y, fc} < y, establishes ([8]), and hence ([7]). Applying that to Q, we obtain 

< 1 + + min{y, k} 



v{H',) - 2 - 4y 

If y < fc, then min{y, A;} = y, so the bound simplifies into 1 + | + |, which is clearly increasing in y. 
Hence, we may increase y up to A; and only increase the bound further. Without loss of generality, we 
may now assume that y > k. 

Yet ii y > k, then min{y, k} = k, so we have the bound 



v{H'^) - 2 - 4y ' 

and we will show that it is increasing in y. Taking the derivative of the bound, we obtain 

d _1 k 
dy~2~ V' 
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which is > since we assumed that y > k, and k > 1. The maximum possible value of y = (e + /)/2 
is [t/2j, so we may increase y to this value and find 



v{H',) - 2 - 4 LfJ ■ 

The final bound is precisely e{Hk) / {v{Hk) — 2), so we are done. □ 

Proof of Lemma IB. 51 for Lt/2j < fe < 2Lt/2j. The proof in this regime is very similar to the 
previous one, but we write it out for completeness. Consider a consecutive pair {Hk, Hk+i). Let 
I = k — \ t/2\. By the construction, Hf^. has the following structure. The vertex set is partitioned into 
{Vi}^, with IV3I = IV4I = 1 and all other \Vi\ = [t/2\. The pairs (Vi, V2) and (V5, Vg) are each complete 
bipartite graphs, the vertex in V3 is adjacent to all of V2 U V4 U Vq, the vertex in V4 is adjacent to all 
of Vi U V3 U V5, there is a perfect [i/2j-edge matching between Vi and Vq, and there is an {£ — l)-edge 
matching between V5 and V2. There are no other edges. Also, there is a pair of vertices u € V5, 
u € V2, not involved in the {i — l)-edge matching, at which the addition of an edge creates Now 
consider any family of subsets Ui C Vi such that u & and v G U2- Let H'j^ be the subgraph of 
induced by UC/j. We must show that e{H',^)/{v{H'j^) - 2) < e{Hk)/{v{Hk) - 2). 

For brevity, let a = |?7i|, h = \U2\-, c = IC/3I, d = IC/4I, e = \U^\, and / = \Uq\. Since the edges 
between and U2 form a matching of at most I — \ edges which does not involve u £ 01 v U2, 
there can be at most min{6— 1, e — l,i—l} = mm{b, e,l} — l edges there. On the other hand, the edges 
between Ui and Uq also are a matching, and we may bound their number by min{a, /}. Therefore, 

e{Hl) ^ah + ef + c{b + /) + (a + e)d + cd + min{a, /} + (min{6, e, i} - 1) 



v{Hl)-2- a + b + c + d + e + f -2 

As in the proof of this lemma for k < \t/2\, our first step is to show that the bound is nondecreasing 
in c by calculating its partial derivative. The calculations are almost identical, and we arrive at 

d 

numerator of — = \(b + d + ff - 2(b + d + f) + l] + [af - mm{a, /}] + [eb - min{6, e,i}]. 
oc 

The first bracket is a perfect square, and each of the other brackets is nonnegative because all variables 
are nonnegative integers. Hence we may set c = 1 (its maximum value) and increase the bound. The 
same holds for d by symmetry. Therefore, 

e{H'j^) ^ a6 + e/ + (6 + /) + (a + e) + l + min{a,/} + (min{6,e,£}-l) 



v{H'^)-2 - a + b + l + l + e + f 

ab + ef + min{a, /} + min{6, e, i} 



= 1 + 
< 1 + 



a + b + e + f 
ab + ef + ^+mm{^,i} 



a + b + e + f 

We again will use a "smoothing" technique. Let x = {a + b)/2, and consider the effect of setting 
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a = b = X. The denominator will remain invariant, but the numerator will increase by 



a + bV ^ + f 



a+b 



+ min 



+ e 



a — b 



+ 



b — a 



+ 



mm 



— ,(> — min 



ab H h mm 



b + e 



b + e 
2 ' 



2 ' 



> 



a — b\ b — a 

H h mm 



a — 6 



+ 



b — a 



+ min 



2 

a — b 



-,o 



which is always > since a and 6 are integers. A similar results holds if we let y = (e + /)/2. Therefore, 



< 1 



+ j/' + ^ + min{^,4 
2x + 2y 



(9) 



The bound is symmetric with respect to x and y, so we may assume without loss of generality that 
X < y- Our next claim is that increasing x to equal y only increases the bound further. That is, we 
aim to prove the following: 



x2 + y2 + £+a 



+ min{^,£} ^ y2 _^y2 + min{y,£} 



< 



2x + 2y - 2y + 2y 

Cross- multiplying to clear denominators, this is equivalent to 

' x + y 



(10) 



4x y + Ay + 2y{x + y) + 4y min 



< Axy-^ + + (2x + 2y)y + {2x + 2y) min{y, ^}. 



Rearranging terms and using the substitution min{(a; + y)/2,£} < min{y,£} (we assumed x < y), it 
suffices to show 

< (y-a;)(4x?/-2min{y,^}). (11) 

Yet X = (a + b)/2 and v (z U2 ^ b > 1, so we must have x > 1/2. This, along with y > x and 
mm{y,£} < y, establishes p!T|) . and hence (fTUl) . Applying that to we obtain 

e(^^fc) < ^ V + y + min{2/, ^} 



4y 



If y < then min{y, i} = y and the bound becomes 1 + | + ^, which is clearly increasing in y. Hence, 
we may increase y up to £ and only increase the bound further. Without loss of generality, we may 
now assume that y > L 

Yet if y > £, then min{y, = so we have the bound 



viH'k)-2 



< 1 + 



2y^ + y + . 
Ay 



and we will show that it is increasing in y. Taking the derivative of the bound, we obtain 

d _l £ 
dy~2~ V' 
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which is > since we assumed that y > i, and £ > 1. The maximum possible value of y = (e + /)/2 
is [t/2j, so we may increase y to this value and find 



t 



v{H',)-2- 4y 
The final bound is precisely e{Hk) / {v{H]^) — 2), so we are done. □ 

Proof of Lemma IB. 51 for k > 2|_t/2j. Consider a consecutive pair (Hk, H^^i). By construction, 
Hk has the following structure. The vertex set is partitioned into {V^}]*, with IV3I = IV4I = 1 and all 
other \Vi\ = [t/2\ . The pairs (Vi, V2) and (V5, Vq) are each complete bipartite graphs, the vertex in V3 
is adjacent to all of V2 U V4 U Vg, the vertex in V4 is adjacent to all of Vi U V3 U V5, there is a perfect 
[t/2j-edge matching between Vi and Vq, and another perfect matching between V5 and V2. There 
may be some more edges as well between Vi and Vq or between V5 and V2, but not all such edges are 
present: without loss of generality, let us suppose that there are two vertices u £ Vi and v £ Vq such 
that there is no edge between u and v. There are no more edges in the entire graph. Also, H^+i is 
obtained from by adding the edge joining u and v. Now, consider any family of subsets Ui dVi 
such that u £ Ui and v £ Uq. Let H'j^ be the subgraph of Hk induced by UC/j. We must show that 
e{H',)/{v{H',) - 2) < e{Hk)/{v{Hk) - 2). 

For brevity, let a = \Ui\, b = IC/2I, c = |?73|, d = {U^l, e = {U^l, and / = \Uq\. Let E be the number 
of edges in between Ui and Uq or between U5 and U2- Then 

e(F^) _ ab + ef + c{b + f) + {a + e)d + cd + E 

[l2) 



v{H'^)-2 a + b + c + d + e + f -2 

Next, recall that contained a perfect [i/2j-edge matching between Vi and Vq, and between V5 and 
V2. The maximum number of edges of these matchings that are included in E (i.e., go between Ui 
and Uq, or between U5 and U2) is min{a, /} + min{6, e} < (a + / + 5 + e)/2. Therefore, the number of 
edges in between Vi and Vq or between V5 and V2 is at least -E + 2|_|J — ^±±+£±1. xhe rest of the 
edges in are easy to count: (Vi, V2) and (V5, Vq) are complete bipartite subgraphs -fi^[t/2j,[t/2j 1 the 
vertex in V3 is adjacent to all of V2 D V4L) Vq, and the vertex in V4 is adjacent to all of Vi U V3 U V5. 
Therefore 

eiHk) ^ 2[i\'+[A[i\+l] + [E + 2[i\-^-±^] _ ^^^^ 



v{Hk) - 2 - 2t - 2 

Identifying the denominator as 4[t/2j since t is odd, we may simplify the bound: 



e{Hk) ^ 1 



v{Hk) - 2 - 2 
and so it suffices to show that 



3 l + E- "+''+^+/ 
^ 2 ^ A\tj2\ ' 



ab + ef + c{b + f) + {a + e)d + cd + E ^ 1 
a + b + c + d + e + f -2 ~ 2 



3 l + E- ^+^+^+f 
^ 2 ^ A\t/2\ ' 



Observe that E appears in the numerator on both sides. Since the denominator on the left hand 
side is always less than or equal to the denominator on the right hand side, we may replace E 
(on both sides) by its maximum possible value and only sharpen the inequality. This maximum 
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is (iC/illf/gl ~ 1) + (|f^5||t^2|) = af — 1 + be, because we assumed that Hk had no edge between u € C/i 
and V (z Uq. Thus it suffices to show that 



ab + ef + c{b + f) + {a + e)d + cd + {af - I + be) ^ 1 



a + h + c + d + e + f -2 ~ 2 

Simplifying the numerator on each side, this is equivalent to 



3 1 + (g/ - 1 + 6e) - 
^ 2 ^ 4\t/2\ 



{a + c + e){b + d + f)-l ^ 1 
a + b + c + d + e + f -2 ~ 2 



3 (a + e)(6 + /)-(ab + e/)-^^±^ 
2 4[t/2j 



(14) 



Note that the variables c and d appear only on the left hand side. We will show that the left hand side 
is nondecreasing in each variable, which will then allow us to replace each of them by their maximum 
value of 1 and only sharpen the inequality. Let us verify this by taking the partial derivative of the 
left hand side with respect to c: 



d 



numerator of — 
oc 



c + e){b + d + f)-l] 



{a + b + c + d + e + f -2){b + d + f) -[( 
= ib + d+ f)^ -2{b + d + f) + l. 

This is a perfect square, so it is > as claimed. By symmetry, a similar result holds for d; therefore 



(a + c + e)(6 + d + /) - 1 ^ (a + 1 + e)(6 + 1 + /) - 1 



a + b + c + d + e + f -2 a + 6+l + l + e + / 
and applying this to ([E]), we see that it suffices to show 



1 + 



(a + e)(fe + /) 
a + b + e + f 



(a + e)(6 + /) ^ 1 
a + b + e + f - 2 



1 (fl + e)(6 + /)-(a6 + e/)-^^±^ 

2 ^ 4[t/2\ 



(15) 



Our next step is to apply a "smoothing" argument to push (a + e) and (6 + /) closer together while 
preserving their sum. We claim that this will only sharpen the inequality. By symmetry, we may 
assume without loss of generality that a + e < b + f . Let A = [(5 + /) — (a + e)]/4, and consider 
the effect on (I15p of increasing each of a and e by A, while decreasing each of b and / by A. This 
will bring (a + e) and (6 + /) together to their average, which by convexity, increases their product 
(a + e)(6 + /). This product appears in the numerator on both sides, and since the denominator on the 
left is at most the denominator on the right, this simultaneous change can only sharpen the inequality. 
The only other term that will change is ab + ef , so it remains to verify that it increases under this 
transformation: 



(a + A)(6 - A) + (e + A)(/ - A) = {ab + ef) + {b + f 



e)A - 2A2 = {ah + ef) + 2A^ 



This proves our claim that we may "smooth" (a + e) and {b + /) together and only sharpen the 
inequality. Therefore, we may now assume that a + e = b + f for the remainder of the proof; let 
x = {a + e)/2. 

Rewriting the right hand side of (jlSp . it suffices to show 



(a + e)(b + /) ^ 1 
a + b + e + f - 2 



1 af + be- ^±^+^ 



+ 2 + 



4Li/2j 
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Substituting a + e = 2x = b + f into the inequality, this is equivalent to 



(2x)(2x) ^ 1 
2x + 2x - 2 



1 af + be- 
^ 2 ^ 4[t72] ' 



Simplifying each side of the inequality, this is equivalent to 



1 

X < - 
- 2 



1 af + be — 2x , , 

+ 2 + 4Lt/2J ■ 



Observe that we always have 2x < 2[t/2j because < a,b,e, f < [t/2j; hence ^ + "-^^^^^^^^ > 0. 
Therefore, if x itself is already < ^ , then we are already done. 

It remains to consider the case when 2x > [t/2\, which we will assume for the remainder of this 
proof. Let us replace the af + be term by a function of x, so that the entire inequality will be in terms 
of X and t only. That is, we wish to find the minimum value of af + be when a,b,e,f £ [0, [t/2j], 
subject to the constraints a + e = 2x = b + f. Writing e = 2x — a and / = 2x — 6, we aim to minimize 

af + be = a{2x -b)+ b{2x - a) = 2x{a + b) - 2ab. 

For fixed a + b, the right hand side is minimized when a = b, so we may assume that is the case; let 
z = a = b. We are then trying to minimize 2x{2z) — 2z^ for fixed x. The variable z is constrained to 
the interval [2x — [t/2j, [t/2j], because < a < [t/2j and ^<e = 2x — a< [t/2j and we assumed 
2x > \t/2\. Since 2x{2z) — 2z^ is quadratic in z with negative leading coefficient, it is minimized at 
one of the endpoints of z's interval. It turns out that it takes that same value 2[t/2j(2x — [t/2j) at 
each endpoint, so we conclude that af + be> 2[t/2j(2x — [t/2j). 
Substituting this into (jl6p . it now suffices to show 



1 

x< - 
- 2 



1 2[t/2\{2x-[t/2\)-2x 

2 4[t/2j 



Rearranging terms, this is equivalent to 

2x 1 
4[t/2j - 2' 

which is true because x = (a + e)/2 and a,e < [t/2j , so we are done. □ 
B.2 Inequalities 

For the reader's convenience, we reproduce the definitions of s and 9: 

s = Liog.[(r - i)t + e= " 'It' , . 



r*(f2 - s) + 



r-l 



Inequality B.6. Suppose that t > 3 and r > 2. Then —9 > — f • 
Proof. Clearing the denominators, this is equivalent to 



r\2t^ - 2t) + 2t< r%2t^ - 2s) + 2- 
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Rearranging terms, this is equivalent to 

1 



2r'(s -t) + 2t <2- 



r - 1 

Since the right hand side is always positive, it suffices to show that the left hand side is < 0. Note 
that if s < t/2, then the left hand side is indeed < t{—r^ + 2) < since r > 2 and s > 1. This happens 
whenever t > 4 by Lemma IA.21 and whenever t = 3 and r > 3 because Lemma lA.ll shows that then 
s < [log3(2t + 1)J = 1 < t/2. The only remaining case is t = 3, r = 2, in which case s = 2, and one 
may manually verify that the left hand side is precisely — 2 < 0. □ 

Inequality B.7. For any t > 3 and r >2, if p ^ n^^ , then n^*^^^*^^'*^^ is a positive power of n. 

Proof. The proof of this inequality is nearly identical to the proof of Inequalitv lA.7[ Cross-multiplying 
to clear the denominators, it suffices to show 

r — 1 

The right hand side of (fT7|) is equal to 



{2t - 2) 



r%t^-s) + 



>it^-s-l)[r%2t-2) + 2]. (17) 



{t^ -s- 1) [r'{2t - 2) + 2] = {t^ - syi2t - 2) - r'{2t - 2) + {t^ - s - 1)2, 
so pT|) is equivalent to 

— 1 

{2t - 2) > -r%2t - 2) + (t^ - s - 1)2, 

r — 1 

and by rearranging terms and ignoring the s term appearing in the parentheses on the right hand side, 
it suffices to establish 

j.S + 1 _ 1 

(2t-2)- > (t2-l)2. 

r — 1 

Dividing through by 2{t — 1), we see that it suffices to show that 



>t + l, 



r — 1 

which was already shown at the end of the proof of Inequality I A. 71 □ 

Inequality B.8. For any t > 3 and r >2, if p <C n^^ , then 7^2t-2pt2-s ^ negative power of n. 

Proof. The proof of this inequality is nearly identical to the proof of Inequality IA.8I We must 
establish the following inequality: 

— 1 " 

(2t-2) -"^^2 ^ 

This is equivalent to 



r'{t' -s) + 



< {t^ - s) [r'{2t - 2) + 2] . 



r — 1 
r' - 1 



{2t-2) < (t^-s)2. 

r — 1 

By definition of s, r^ < {r — l)t + 1. Therefore, the left hand side is at most {2t — 2)t, and it suffices 
to show that 

2*2 -2t < 2*2 - 2s, 

i.e., to show that s < t. But Lemma I A. II established that if r > 2, then s < [log2(* + 1)J, and this is 
indeed less than t for all t > 3. □ 
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